Michael Hartung, Andreas Maier, Fernando Delgado-Chaves, Yuliya Burankova, Olga I. Isaeva, Fábio Malta de Sá Patroni, Daniel He, Casey Shannon, Katharina Kaufmann, Jens Lohmann, Alexey Savchik, Anne Hartebrodt, Zoe Chervontseva, Farzaneh Firoozbakht, Niklas Probul, Evgenia Zotova, Olga Tsoy, David B. Blumenthal, Martin Ester, Tanja Laske, Jan Baumbach, Olga Zolotareva
{"title":"UnPaSt:通过 omics 数据中的差异表达双簇对患者进行无监督分层","authors":"Michael Hartung, Andreas Maier, Fernando Delgado-Chaves, Yuliya Burankova, Olga I. Isaeva, Fábio Malta de Sá Patroni, Daniel He, Casey Shannon, Katharina Kaufmann, Jens Lohmann, Alexey Savchik, Anne Hartebrodt, Zoe Chervontseva, Farzaneh Firoozbakht, Niklas Probul, Evgenia Zotova, Olga Tsoy, David B. Blumenthal, Martin Ester, Tanja Laske, Jan Baumbach, Olga Zolotareva","doi":"arxiv-2408.00200","DOIUrl":null,"url":null,"abstract":"Most complex diseases, including cancer and non-malignant diseases like\nasthma, have distinct molecular subtypes that require distinct clinical\napproaches. However, existing computational patient stratification methods have\nbeen benchmarked almost exclusively on cancer omics data and only perform well\nwhen mutually exclusive subtypes can be characterized by many biomarkers. Here,\nwe contribute with a massive evaluation attempt, quantitatively exploring the\npower of 22 unsupervised patient stratification methods using both, simulated\nand real transcriptome data. From this experience, we developed UnPaSt\n(https://apps.cosy.bio/unpast/) optimizing unsupervised patient stratification,\nworking even with only a limited number of subtype-predictive biomarkers. We\nevaluated all 23 methods on real-world breast cancer and asthma transcriptomics\ndata. Although many methods reliably detected major breast cancer subtypes,\nonly few identified Th2-high asthma, and UnPaSt significantly outperformed its\nclosest competitors in both test datasets. Essentially, we showed that UnPaSt\ncan detect many biologically insightful and reproducible patterns in omic\ndatasets.","PeriodicalId":501070,"journal":{"name":"arXiv - QuanBio - Genomics","volume":"43 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"UnPaSt: unsupervised patient stratification by differentially expressed biclusters in omics data\",\"authors\":\"Michael Hartung, Andreas Maier, Fernando Delgado-Chaves, Yuliya Burankova, Olga I. Isaeva, Fábio Malta de Sá Patroni, Daniel He, Casey Shannon, Katharina Kaufmann, Jens Lohmann, Alexey Savchik, Anne Hartebrodt, Zoe Chervontseva, Farzaneh Firoozbakht, Niklas Probul, Evgenia Zotova, Olga Tsoy, David B. Blumenthal, Martin Ester, Tanja Laske, Jan Baumbach, Olga Zolotareva\",\"doi\":\"arxiv-2408.00200\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Most complex diseases, including cancer and non-malignant diseases like\\nasthma, have distinct molecular subtypes that require distinct clinical\\napproaches. However, existing computational patient stratification methods have\\nbeen benchmarked almost exclusively on cancer omics data and only perform well\\nwhen mutually exclusive subtypes can be characterized by many biomarkers. Here,\\nwe contribute with a massive evaluation attempt, quantitatively exploring the\\npower of 22 unsupervised patient stratification methods using both, simulated\\nand real transcriptome data. From this experience, we developed UnPaSt\\n(https://apps.cosy.bio/unpast/) optimizing unsupervised patient stratification,\\nworking even with only a limited number of subtype-predictive biomarkers. We\\nevaluated all 23 methods on real-world breast cancer and asthma transcriptomics\\ndata. Although many methods reliably detected major breast cancer subtypes,\\nonly few identified Th2-high asthma, and UnPaSt significantly outperformed its\\nclosest competitors in both test datasets. Essentially, we showed that UnPaSt\\ncan detect many biologically insightful and reproducible patterns in omic\\ndatasets.\",\"PeriodicalId\":501070,\"journal\":{\"name\":\"arXiv - QuanBio - Genomics\",\"volume\":\"43 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - QuanBio - Genomics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.00200\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Genomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.00200","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
UnPaSt: unsupervised patient stratification by differentially expressed biclusters in omics data
Most complex diseases, including cancer and non-malignant diseases like
asthma, have distinct molecular subtypes that require distinct clinical
approaches. However, existing computational patient stratification methods have
been benchmarked almost exclusively on cancer omics data and only perform well
when mutually exclusive subtypes can be characterized by many biomarkers. Here,
we contribute with a massive evaluation attempt, quantitatively exploring the
power of 22 unsupervised patient stratification methods using both, simulated
and real transcriptome data. From this experience, we developed UnPaSt
(https://apps.cosy.bio/unpast/) optimizing unsupervised patient stratification,
working even with only a limited number of subtype-predictive biomarkers. We
evaluated all 23 methods on real-world breast cancer and asthma transcriptomics
data. Although many methods reliably detected major breast cancer subtypes,
only few identified Th2-high asthma, and UnPaSt significantly outperformed its
closest competitors in both test datasets. Essentially, we showed that UnPaSt
can detect many biologically insightful and reproducible patterns in omic
datasets.