{"title":"Paying attention to astronomical transients: Introducing the time-series transformer for photometric classification","authors":"Tarek Allam, Jason D McEwen","doi":"10.1093/rasti/rzad046","DOIUrl":null,"url":null,"abstract":"Future surveys such as the Legacy Survey of Space and Time (LSST) of the Vera C. Rubin Observatory will observe an order of magnitude more astrophysical transient events than any previous survey before. With this deluge of photometric data, it will be impossible for all such events to be classified by humans alone. Recent efforts have sought to leverage machine learning methods to tackle the challenge of astronomical transient classification, with ever improving success. Transformers are a recently developed deep learning architecture, first proposed for natural language processing, that have shown a great deal of recent success. In this work we develop a new transformer architecture, which uses multi-head self attention at its core, for general multi-variate time-series data. Furthermore, the proposed time-series transformer architecture supports the inclusion of an arbitrary number of additional features, while also offering interpretability. We apply the time-series transformer to the task of photometric classification, minimising the reliance of expert domain knowledge for feature selection, while achieving results comparable to state-of-the-art photometric classification methods. We achieve a logarithmic-loss of 0.507 on imbalanced data in a representative setting using data from the Photometric LSST Astronomical Time-Series Classification Challenge (PLAsTiCC). Moreover, we achieve a micro-averaged receiver operating characteristic area under curve of 0.98 and micro-averaged precision-recall area under curve of 0.87.","PeriodicalId":500957,"journal":{"name":"RAS Techniques and Instruments","volume":"85 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"RAS Techniques and Instruments","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/rasti/rzad046","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Future surveys such as the Legacy Survey of Space and Time (LSST) of the Vera C. Rubin Observatory will observe an order of magnitude more astrophysical transient events than any previous survey before. With this deluge of photometric data, it will be impossible for all such events to be classified by humans alone. Recent efforts have sought to leverage machine learning methods to tackle the challenge of astronomical transient classification, with ever improving success. Transformers are a recently developed deep learning architecture, first proposed for natural language processing, that have shown a great deal of recent success. In this work we develop a new transformer architecture, which uses multi-head self attention at its core, for general multi-variate time-series data. Furthermore, the proposed time-series transformer architecture supports the inclusion of an arbitrary number of additional features, while also offering interpretability. We apply the time-series transformer to the task of photometric classification, minimising the reliance of expert domain knowledge for feature selection, while achieving results comparable to state-of-the-art photometric classification methods. We achieve a logarithmic-loss of 0.507 on imbalanced data in a representative setting using data from the Photometric LSST Astronomical Time-Series Classification Challenge (PLAsTiCC). Moreover, we achieve a micro-averaged receiver operating characteristic area under curve of 0.98 and micro-averaged precision-recall area under curve of 0.87.
未来的调查,如Vera C. Rubin天文台的遗产时空调查(LSST),将比以往任何调查都能观测到更多的天体物理瞬变事件。有了这些海量的光度数据,单靠人类对所有这些事件进行分类是不可能的。最近的努力试图利用机器学习方法来解决天文瞬态分类的挑战,并取得了越来越大的成功。变形金刚是最近开发的一种深度学习架构,最初是为自然语言处理提出的,最近取得了很大的成功。在这项工作中,我们开发了一种新的变压器架构,它以多头自关注为核心,用于一般的多变量时间序列数据。此外,所建议的时间序列转换器体系结构支持包含任意数量的附加特性,同时还提供可解释性。我们将时间序列转换器应用于光度分类任务,最大限度地减少了特征选择对专家领域知识的依赖,同时实现了与最先进的光度分类方法相当的结果。我们使用来自Photometric LSST天文时间序列分类挑战(PLAsTiCC)的数据,在代表性设置中对不平衡数据实现了0.507的对数损失。此外,我们还实现了微平均接收机工作特征曲线下面积为0.98,微平均精确召回面积为0.87。