Mridul Khurana, Arka Daw, M. Maruf, Josef C. Uyeda, Wasila Dahdul, Caleb Charpentier, Yasin Bakış, Henry L. Bart Jr., Paula M. Mabee, Hilmar Lapp, James P. Balhoff, Wei-Lun Chao, Charles Stewart, Tanya Berger-Wolf, Anuj Karpatne
{"title":"Hierarchical Conditioning of Diffusion Models Using Tree-of-Life for Studying Species Evolution","authors":"Mridul Khurana, Arka Daw, M. Maruf, Josef C. Uyeda, Wasila Dahdul, Caleb Charpentier, Yasin Bakış, Henry L. Bart Jr., Paula M. Mabee, Hilmar Lapp, James P. Balhoff, Wei-Lun Chao, Charles Stewart, Tanya Berger-Wolf, Anuj Karpatne","doi":"arxiv-2408.00160","DOIUrl":null,"url":null,"abstract":"A central problem in biology is to understand how organisms evolve and adapt\nto their environment by acquiring variations in the observable characteristics\nor traits of species across the tree of life. With the growing availability of\nlarge-scale image repositories in biology and recent advances in generative\nmodeling, there is an opportunity to accelerate the discovery of evolutionary\ntraits automatically from images. Toward this goal, we introduce\nPhylo-Diffusion, a novel framework for conditioning diffusion models with\nphylogenetic knowledge represented in the form of HIERarchical Embeddings\n(HIER-Embeds). We also propose two new experiments for perturbing the embedding\nspace of Phylo-Diffusion: trait masking and trait swapping, inspired by\ncounterpart experiments of gene knockout and gene editing/swapping. Our work\nrepresents a novel methodological advance in generative modeling to structure\nthe embedding space of diffusion models using tree-based knowledge. Our work\nalso opens a new chapter of research in evolutionary biology by using\ngenerative models to visualize evolutionary changes directly from images. We\nempirically demonstrate the usefulness of Phylo-Diffusion in capturing\nmeaningful trait variations for fishes and birds, revealing novel insights\nabout the biological mechanisms of their evolution.","PeriodicalId":501044,"journal":{"name":"arXiv - QuanBio - Populations and Evolution","volume":"185 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Populations and Evolution","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.00160","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
A central problem in biology is to understand how organisms evolve and adapt
to their environment by acquiring variations in the observable characteristics
or traits of species across the tree of life. With the growing availability of
large-scale image repositories in biology and recent advances in generative
modeling, there is an opportunity to accelerate the discovery of evolutionary
traits automatically from images. Toward this goal, we introduce
Phylo-Diffusion, a novel framework for conditioning diffusion models with
phylogenetic knowledge represented in the form of HIERarchical Embeddings
(HIER-Embeds). We also propose two new experiments for perturbing the embedding
space of Phylo-Diffusion: trait masking and trait swapping, inspired by
counterpart experiments of gene knockout and gene editing/swapping. Our work
represents a novel methodological advance in generative modeling to structure
the embedding space of diffusion models using tree-based knowledge. Our work
also opens a new chapter of research in evolutionary biology by using
generative models to visualize evolutionary changes directly from images. We
empirically demonstrate the usefulness of Phylo-Diffusion in capturing
meaningful trait variations for fishes and birds, revealing novel insights
about the biological mechanisms of their evolution.