Fully invertible hyperbolic neural networks for segmenting large-scale surface and sub-surface data

Artificial Intelligence in Geosciences Pub Date : 2024-08-24 DOI:10.1016/j.aiig.2024.100087

Bas Peters , Eldad Haber , Keegan Lensink

{"title":"Fully invertible hyperbolic neural networks for segmenting large-scale surface and sub-surface data","authors":"Bas Peters , Eldad Haber , Keegan Lensink","doi":"10.1016/j.aiig.2024.100087","DOIUrl":null,"url":null,"abstract":"<div><p>The large spatial/temporal/frequency scale of geoscience and remote-sensing datasets causes memory issues when using convolutional neural networks for (sub-) surface data segmentation. Recently developed fully reversible or fully invertible networks can mostly avoid memory limitations by recomputing the states during the backward pass through the network. This results in a low and fixed memory requirement for storing network states, as opposed to the typical linear memory growth with network depth. This work focuses on a fully invertible network based on the telegraph equation. While reversibility saves the major amount of memory used in deep networks by the data, the convolutional kernels can take up most memory if fully invertible networks contain multiple invertible pooling/coarsening layers. We address the explosion of the number of convolutional kernels by combining fully invertible networks with layers that contain the convolutional kernels in a compressed form directly. A second challenge is that invertible networks output a tensor the same size as its input. This property prevents the straightforward application of invertible networks to applications that map between different input–output dimensions, need to map to outputs with more channels than present in the input data, or desire outputs that decrease/increase the resolution compared to the input data. However, we show that by employing invertible networks in a non-standard fashion, we can still use them for these tasks. Examples in hyperspectral land-use classification, airborne geophysical surveying, and seismic imaging illustrate that we can input large data volumes in one chunk and do not need to work on small patches, use dimensionality reduction, or employ methods that classify a patch to a single central pixel.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100087"},"PeriodicalIF":0.0000,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544124000285/pdfft?md5=aefb3645cc92ad5ad25d7d3f97a32057&pid=1-s2.0-S2666544124000285-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Geosciences","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666544124000285","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The large spatial/temporal/frequency scale of geoscience and remote-sensing datasets causes memory issues when using convolutional neural networks for (sub-) surface data segmentation. Recently developed fully reversible or fully invertible networks can mostly avoid memory limitations by recomputing the states during the backward pass through the network. This results in a low and fixed memory requirement for storing network states, as opposed to the typical linear memory growth with network depth. This work focuses on a fully invertible network based on the telegraph equation. While reversibility saves the major amount of memory used in deep networks by the data, the convolutional kernels can take up most memory if fully invertible networks contain multiple invertible pooling/coarsening layers. We address the explosion of the number of convolutional kernels by combining fully invertible networks with layers that contain the convolutional kernels in a compressed form directly. A second challenge is that invertible networks output a tensor the same size as its input. This property prevents the straightforward application of invertible networks to applications that map between different input–output dimensions, need to map to outputs with more channels than present in the input data, or desire outputs that decrease/increase the resolution compared to the input data. However, we show that by employing invertible networks in a non-standard fashion, we can still use them for these tasks. Examples in hyperspectral land-use classification, airborne geophysical surveying, and seismic imaging illustrate that we can input large data volumes in one chunk and do not need to work on small patches, use dimensionality reduction, or employ methods that classify a patch to a single central pixel.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用于分割大规模地表和次地表数据的完全可逆双曲神经网络

地球科学和遥感数据集的空间/时间/频率尺度较大，在使用卷积神经网络进行（次）表面数据分割时会产生内存问题。最近开发的完全可逆或完全可逆网络通过在网络后向传递过程中重新计算状态，在很大程度上避免了内存限制。这就使得存储网络状态所需的内存较低且固定，而不是典型的随网络深度线性增长的内存。这项工作的重点是基于电报方程的完全可逆网络。虽然可逆性节省了深度网络中数据所使用的大部分内存，但如果完全可逆网络包含多个可逆池/解析层，卷积核可能会占用大部分内存。我们通过将完全可逆网络与直接包含压缩形式卷积核的层结合起来，解决了卷积核数量激增的问题。第二个挑战是，可逆网络输出的张量与其输入大小相同。这一特性阻碍了可逆网络在以下应用中的直接应用：在不同的输入-输出维度之间进行映射，需要映射到比输入数据具有更多通道的输出，或者希望输出与输入数据相比降低/提高分辨率。不过，我们的研究表明，通过以非标准方式使用可逆网络，我们仍然可以将其用于这些任务。高光谱土地利用分类、机载地球物理勘测和地震成像中的例子说明，我们可以一次性输入大量数据，而无需处理小块数据、使用降维或采用将小块数据分类为单个中心像素的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Artificial Intelligence in Geosciences

CiteScore

4.20

自引率

0.00%

发文量