The entropy of galaxy spectra: How much information is encoded?

I. Ferreras, O. Lahav, R. Somerville, J. Silk
{"title":"The entropy of galaxy spectra: How much information is encoded?","authors":"I. Ferreras, O. Lahav, R. Somerville, J. Silk","doi":"10.1093/rasti/rzad004","DOIUrl":null,"url":null,"abstract":"\n The inverse problem of extracting the stellar population content of galaxy spectra is analysed here from a basic standpoint based on information theory. By interpreting spectra as probability distribution functions, we find that galaxy spectra have high entropy, thus leading to a rather low effective information content. The highest variation in entropy is unsurprisingly found in regions that have been well studied for decades with the conventional approach. We target a set of six spectral regions that show the highest variation in entropy – the 4000 Å break being the most informative one. As a test case with real data, we measure the entropy of a set of high quality spectra from the Sloan Digital Sky Survey, and contrast entropy-based results with the traditional method based on line strengths. The data are classified into star-forming (SF), quiescent (Q) and AGN galaxies, and show – independently of any physical model – that AGN spectra can be interpreted as a transition between SF and Q galaxies, with SF galaxies featuring a more diverse variation in entropy. The high level of entanglement complicates the determination of population parameters in a robust, unbiased way, and affect traditional methods that compare models with observations, as well as machine learning (especially deep learning) algorithms that rely on the statistical properties of the data to assess the variations among spectra. Entropy provides a new avenue to improve population synthesis models so that they give a more faithful representation of real galaxy spectra.","PeriodicalId":367327,"journal":{"name":"RAS Techniques and Instruments","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"RAS Techniques and Instruments","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/rasti/rzad004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

The inverse problem of extracting the stellar population content of galaxy spectra is analysed here from a basic standpoint based on information theory. By interpreting spectra as probability distribution functions, we find that galaxy spectra have high entropy, thus leading to a rather low effective information content. The highest variation in entropy is unsurprisingly found in regions that have been well studied for decades with the conventional approach. We target a set of six spectral regions that show the highest variation in entropy – the 4000 Å break being the most informative one. As a test case with real data, we measure the entropy of a set of high quality spectra from the Sloan Digital Sky Survey, and contrast entropy-based results with the traditional method based on line strengths. The data are classified into star-forming (SF), quiescent (Q) and AGN galaxies, and show – independently of any physical model – that AGN spectra can be interpreted as a transition between SF and Q galaxies, with SF galaxies featuring a more diverse variation in entropy. The high level of entanglement complicates the determination of population parameters in a robust, unbiased way, and affect traditional methods that compare models with observations, as well as machine learning (especially deep learning) algorithms that rely on the statistical properties of the data to assess the variations among spectra. Entropy provides a new avenue to improve population synthesis models so that they give a more faithful representation of real galaxy spectra.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
星系光谱的熵:编码了多少信息?
本文从信息论的基本观点出发,分析了提取星系光谱中恒星族含量的反问题。通过将光谱解释为概率分布函数,我们发现星系光谱具有高熵,从而导致有效信息含量相当低。毫无疑问,熵的最大变化出现在那些用传统方法研究了几十年的地区。我们的目标是一组显示熵变化最大的六个光谱区域- 4000 Å断裂是信息量最大的一个。作为实际数据的测试用例,我们测量了来自斯隆数字巡天的一组高质量光谱的熵,并将基于熵的结果与基于线强度的传统方法进行了对比。这些数据被分为恒星形成星系(SF)、静止星系(Q)和AGN星系,并显示——独立于任何物理模型——AGN光谱可以被解释为SF和Q星系之间的过渡,SF星系具有更多样化的熵变化。高水平的纠缠使得以稳健、无偏的方式确定总体参数变得复杂,并影响了将模型与观测值进行比较的传统方法,以及依赖数据统计特性来评估光谱变化的机器学习(尤其是深度学习)算法。熵为改进种群综合模型提供了一条新的途径,使它们能够更忠实地表示真实的星系光谱。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Classifying LEO satellite platforms with boosted decision trees PyExoCross: a Python program for generating spectra and cross sections from molecular line lists The verification of periodicity with the use of recurrent neural networks REPUBLIC: A variability-preserving systematic-correction algorithm for PLATO’s multi-camera light curves A simple spacecraft – vector intersection methodology and applications
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1