Nur Sabrina Azmi, Azurah A Samah, Hairudin Abdul Majid, Zuraini Ali Shah, H. Hashim, Nuraina Syaza Azman, Ezzeddin Kamil Mohamed Hashim
{"title":"基于多组学数据的深度神经网络对肿瘤肉瘤的分类","authors":"Nur Sabrina Azmi, Azurah A Samah, Hairudin Abdul Majid, Zuraini Ali Shah, H. Hashim, Nuraina Syaza Azman, Ezzeddin Kamil Mohamed Hashim","doi":"10.11113/ijic.v12n1.360","DOIUrl":null,"url":null,"abstract":"The challenge in classifying cancer may lead to inaccurate classification of cancers, especially sarcoma cancer since it consists of rare types of cancer. It is hard for the clinician to confirm the patient's condition because an accurate diagnosis can only be made by the specialist pathology. Therefore, instead of a single omics is used to identify the disease marker, an approach of integrating these omics to represent multi-omics brings more advantages in detecting and presenting the phenotype of the cancers. Nowadays, the advancement of computational models especially deep learning offered promising approaches in solving high-level omics of data with faster processing speed. Hence, the purpose of this study is to classify cancer and non-cancerous patients using Stacked Denoising Autoencoder (SDAE) and One-dimensional Convolutional Neural Network (1D CNN) to evaluate which algorithm classifies better using high correlated multi-omics data. The study employed both computational models to fit multi-omics dataset. Sarcoma omics datasets used in this study was obtained from the Multi-Omics Cancer Benchmark TCGA Pre-processed Data of ACGT Ron Shamir Lab repository. From the results, the accuracy obtained for the SDAE was 50.93% and 52.78% for the 1D CNN. The result show 1D CNN model outperformed SDAE in classifying sarcoma cancer.","PeriodicalId":50314,"journal":{"name":"International Journal of Innovative Computing Information and Control","volume":"98 ","pages":""},"PeriodicalIF":1.3000,"publicationDate":"2022-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Classifying Sarcoma Cancer Using Deep Neural Networks Based on Multi-Omics Data\",\"authors\":\"Nur Sabrina Azmi, Azurah A Samah, Hairudin Abdul Majid, Zuraini Ali Shah, H. Hashim, Nuraina Syaza Azman, Ezzeddin Kamil Mohamed Hashim\",\"doi\":\"10.11113/ijic.v12n1.360\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The challenge in classifying cancer may lead to inaccurate classification of cancers, especially sarcoma cancer since it consists of rare types of cancer. It is hard for the clinician to confirm the patient's condition because an accurate diagnosis can only be made by the specialist pathology. Therefore, instead of a single omics is used to identify the disease marker, an approach of integrating these omics to represent multi-omics brings more advantages in detecting and presenting the phenotype of the cancers. Nowadays, the advancement of computational models especially deep learning offered promising approaches in solving high-level omics of data with faster processing speed. Hence, the purpose of this study is to classify cancer and non-cancerous patients using Stacked Denoising Autoencoder (SDAE) and One-dimensional Convolutional Neural Network (1D CNN) to evaluate which algorithm classifies better using high correlated multi-omics data. The study employed both computational models to fit multi-omics dataset. Sarcoma omics datasets used in this study was obtained from the Multi-Omics Cancer Benchmark TCGA Pre-processed Data of ACGT Ron Shamir Lab repository. From the results, the accuracy obtained for the SDAE was 50.93% and 52.78% for the 1D CNN. The result show 1D CNN model outperformed SDAE in classifying sarcoma cancer.\",\"PeriodicalId\":50314,\"journal\":{\"name\":\"International Journal of Innovative Computing Information and Control\",\"volume\":\"98 \",\"pages\":\"\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2022-03-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Innovative Computing Information and Control\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.11113/ijic.v12n1.360\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Innovative Computing Information and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11113/ijic.v12n1.360","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
摘要
对癌症进行分类的挑战可能会导致癌症的分类不准确,尤其是肉瘤癌症,因为它由罕见的癌症类型组成。临床医生很难确认病人的病情,因为准确的诊断只能由专业病理学做出。因此,代替单一组学来识别疾病标志物,整合这些组学来代表多组学的方法在检测和呈现癌症表型方面更具优势。如今,计算模型特别是深度学习的进步为解决高水平数据组学提供了有前途的方法,处理速度更快。因此,本研究的目的是利用堆叠去噪自编码器(SDAE)和一维卷积神经网络(1D CNN)对癌症和非癌症患者进行分类,并利用高相关的多组学数据评估哪种算法分类效果更好。该研究采用了两种计算模型来拟合多组学数据集。本研究中使用的肉瘤组学数据集来自ACGT Ron Shamir实验室存储库的多组学癌症基准TCGA预处理数据。从结果来看,SDAE的准确率为50.93%,1D CNN的准确率为52.78%。结果表明,1D CNN模型在肉瘤癌分类上优于SDAE模型。
Classifying Sarcoma Cancer Using Deep Neural Networks Based on Multi-Omics Data
The challenge in classifying cancer may lead to inaccurate classification of cancers, especially sarcoma cancer since it consists of rare types of cancer. It is hard for the clinician to confirm the patient's condition because an accurate diagnosis can only be made by the specialist pathology. Therefore, instead of a single omics is used to identify the disease marker, an approach of integrating these omics to represent multi-omics brings more advantages in detecting and presenting the phenotype of the cancers. Nowadays, the advancement of computational models especially deep learning offered promising approaches in solving high-level omics of data with faster processing speed. Hence, the purpose of this study is to classify cancer and non-cancerous patients using Stacked Denoising Autoencoder (SDAE) and One-dimensional Convolutional Neural Network (1D CNN) to evaluate which algorithm classifies better using high correlated multi-omics data. The study employed both computational models to fit multi-omics dataset. Sarcoma omics datasets used in this study was obtained from the Multi-Omics Cancer Benchmark TCGA Pre-processed Data of ACGT Ron Shamir Lab repository. From the results, the accuracy obtained for the SDAE was 50.93% and 52.78% for the 1D CNN. The result show 1D CNN model outperformed SDAE in classifying sarcoma cancer.
期刊介绍:
The primary aim of the International Journal of Innovative Computing, Information and Control (IJICIC) is to publish high-quality papers of new developments and trends, novel techniques and approaches, innovative methodologies and technologies on the theory and applications of intelligent systems, information and control. The IJICIC is a peer-reviewed English language journal and is published bimonthly