Deep generative learning helps extract ecological meaning from the complex three dimensional shapes of bird bills

Authorea (Authorea) Pub Date : 2023-04-28 DOI:10.22541/au.168269359.92620442/v1

Russell Dinnage, Marian Kleineberg

{"title":"Deep generative learning helps extract ecological meaning from the complex three dimensional shapes of bird bills","authors":"Russell Dinnage, Marian Kleineberg","doi":"10.22541/au.168269359.92620442/v1","DOIUrl":null,"url":null,"abstract":"Data on the three dimensional shape of organismal morphology is becoming increasingly common. The availability of this data forms part of a new revolution in high-throughput phenomics that promises to help understand ecological and evolutionary processes that influence phenotypes at unprecedented scales. However, in order to meet the potential of this revolution we need new data analysis tools to deal with the complexity and heterogeneity of large-scale phenotypic data such as that represented by three dimensional shapes. In this study we explore the potential of techniques derived from Artificial Intelligence research to help organise and extract meaning from complex 3D data. Specifically, we train a deep representational learning method known as DeepSDF, a type of deep generative model, on a dataset of 3D scans of the bills of 2,020 bird species. The model is designed to learn a continuous vector representation of 3D shapes, along with a ‘decoder’ function, that allows the transformation from this vector space to the original 3d morphological space. We find that that the latent vector representations learned by the approach have morphological coherence in the sense that movement along various directions in the latent space form smooth continuous transformations in 3D morphological space, and that particular directions are associated with discernible morphological meaning (such as elongation, flattening, etc.). Learned latent vectors also have ecological meaning as shown by their ability to predict the trophic niche of the bird each bill belongs to with a high degree of accuracy. Unlike existing 3D morphometric techniques, this method has very little requirements for human supervised tasks such as landmark placement, increasing it accessibility to labs with fewer labour resources. It has fewer strong assumptions than alternative dimension reduction techniques such as PCA. The computational requirements for training the model, while substantial, is still within the reasonable reach of most researchers, with a ~2000 shape model taking just over 2 days to train on only a single current generation consumer-level GPU. Once trained, 3D morphology predictions can be made from latent vectors very computationally cheaply. Lastly, we present an in development R package with an implementation of the model, with the goal of making model training accessible to any researcher with access to data and GPU computing resources.","PeriodicalId":487619,"journal":{"name":"Authorea (Authorea)","volume":"97 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Authorea (Authorea)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22541/au.168269359.92620442/v1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Data on the three dimensional shape of organismal morphology is becoming increasingly common. The availability of this data forms part of a new revolution in high-throughput phenomics that promises to help understand ecological and evolutionary processes that influence phenotypes at unprecedented scales. However, in order to meet the potential of this revolution we need new data analysis tools to deal with the complexity and heterogeneity of large-scale phenotypic data such as that represented by three dimensional shapes. In this study we explore the potential of techniques derived from Artificial Intelligence research to help organise and extract meaning from complex 3D data. Specifically, we train a deep representational learning method known as DeepSDF, a type of deep generative model, on a dataset of 3D scans of the bills of 2,020 bird species. The model is designed to learn a continuous vector representation of 3D shapes, along with a ‘decoder’ function, that allows the transformation from this vector space to the original 3d morphological space. We find that that the latent vector representations learned by the approach have morphological coherence in the sense that movement along various directions in the latent space form smooth continuous transformations in 3D morphological space, and that particular directions are associated with discernible morphological meaning (such as elongation, flattening, etc.). Learned latent vectors also have ecological meaning as shown by their ability to predict the trophic niche of the bird each bill belongs to with a high degree of accuracy. Unlike existing 3D morphometric techniques, this method has very little requirements for human supervised tasks such as landmark placement, increasing it accessibility to labs with fewer labour resources. It has fewer strong assumptions than alternative dimension reduction techniques such as PCA. The computational requirements for training the model, while substantial, is still within the reasonable reach of most researchers, with a ~2000 shape model taking just over 2 days to train on only a single current generation consumer-level GPU. Once trained, 3D morphology predictions can be made from latent vectors very computationally cheaply. Lastly, we present an in development R package with an implementation of the model, with the goal of making model training accessible to any researcher with access to data and GPU computing resources.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

深度生成学习有助于从鸟喙复杂的三维形状中提取生态意义

关于生物形态的三维形状的数据正变得越来越普遍。这些数据的可用性构成了高通量表型组学新革命的一部分，有望帮助理解以前所未有的规模影响表型的生态和进化过程。然而，为了满足这场革命的潜力，我们需要新的数据分析工具来处理大规模表型数据的复杂性和异质性，例如由三维形状表示的数据。在这项研究中，我们探索了人工智能研究衍生的技术的潜力，以帮助组织和提取复杂的3D数据的含义。具体来说，我们在2020种鸟类喙的3D扫描数据集上训练了一种称为DeepSDF的深度表征学习方法，这是一种深度生成模型。该模型旨在学习3D形状的连续矢量表示，以及允许从该矢量空间转换到原始3D形态空间的“解码器”功能。我们发现，通过该方法学习的潜在向量表示具有形态一致性，即在潜在空间中沿各个方向的运动在3D形态空间中形成平滑的连续变换，并且特定方向与可识别的形态意义(如伸长，平坦等)相关联。学习潜伏媒介也具有生态学意义，因为它们能够高度准确地预测每只鸟所属的营养生态位。与现有的3D形态测量技术不同，该方法对人工监督任务(如地标放置)的要求非常低，从而增加了实验室劳动力资源较少的可访问性。它比PCA等其他降维技术具有更少的强假设。训练模型的计算需求虽然很大，但仍然在大多数研究人员的合理范围内，仅在当前一代消费级GPU上训练一个~2000形状的模型只需2天多一点。一旦训练，3D形态预测可以从潜在向量非常便宜的计算。最后，我们提出了一个开发中的R包，其中包含模型的实现，目标是使任何可以访问数据和GPU计算资源的研究人员都可以访问模型训练。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Authorea (Authorea)

自引率

0.00%

发文量