{"title":"Argo data anomaly detection based on transformer and Fourier transform","authors":"Longkai Sui, Yongguo Jiang","doi":"10.1016/j.seares.2024.102483","DOIUrl":null,"url":null,"abstract":"<div><p>Argo data is multidimensional observational data from ocean floats, which has long been plagued by data anomalies. Currently, the anomaly detection problem in Argo data still faces many challenges, as anomalies may involve complex relationships between multiple variables, leading to suboptimal performance with traditional machine learning methods. Thanks to the advancement of deep learning, it has become the predominant methodology for anomaly detection, demonstrating notable performance. The Transformer model has shown significant potential in the field of data anomaly detection, with its core Self-Attention mechanism capable of learning relationships between variables. We introduce Fast Fourier Transform (FFT) into the Transformer model, enabling the model to better capture periodic patterns and complex relationships in multivariate data, learning normal data patterns to improve the method for Argo data anomaly detection. Through experiments conducted on three public datasets and the Argo dataset, the enhanced model outperforms the original model in terms of performance. This also demonstrates the potential of FFT in multidimensional data anomaly detection, providing new insights into addressing anomaly detection challenges in real-world complex datasets.</p></div>","PeriodicalId":50056,"journal":{"name":"Journal of Sea Research","volume":"198 ","pages":"Article 102483"},"PeriodicalIF":2.1000,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1385110124000169/pdfft?md5=10dfe2080220941d1d1072a73d8fa08a&pid=1-s2.0-S1385110124000169-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Sea Research","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1385110124000169","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MARINE & FRESHWATER BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Argo data is multidimensional observational data from ocean floats, which has long been plagued by data anomalies. Currently, the anomaly detection problem in Argo data still faces many challenges, as anomalies may involve complex relationships between multiple variables, leading to suboptimal performance with traditional machine learning methods. Thanks to the advancement of deep learning, it has become the predominant methodology for anomaly detection, demonstrating notable performance. The Transformer model has shown significant potential in the field of data anomaly detection, with its core Self-Attention mechanism capable of learning relationships between variables. We introduce Fast Fourier Transform (FFT) into the Transformer model, enabling the model to better capture periodic patterns and complex relationships in multivariate data, learning normal data patterns to improve the method for Argo data anomaly detection. Through experiments conducted on three public datasets and the Argo dataset, the enhanced model outperforms the original model in terms of performance. This also demonstrates the potential of FFT in multidimensional data anomaly detection, providing new insights into addressing anomaly detection challenges in real-world complex datasets.
期刊介绍:
The Journal of Sea Research is an international and multidisciplinary periodical on marine research, with an emphasis on the functioning of marine ecosystems in coastal and shelf seas, including intertidal, estuarine and brackish environments. As several subdisciplines add to this aim, manuscripts are welcome from the fields of marine biology, marine chemistry, marine sedimentology and physical oceanography, provided they add to the understanding of ecosystem processes.