Tom Crossland, Pontus Stenetorp, Daisuke Kawata, Sebastian Riedel, Thomas D. Kitching, Anurag Deshpande, Tom Kimpson, Choong Ling Liew-Cain, Christian Pedersen, Davide Piras, Monu Sharma
{"title":"Toward Machine-learning-based Metastudies: Applications to Cosmological Parameters","authors":"Tom Crossland, Pontus Stenetorp, Daisuke Kawata, Sebastian Riedel, Thomas D. Kitching, Anurag Deshpande, Tom Kimpson, Choong Ling Liew-Cain, Christian Pedersen, Davide Piras, Monu Sharma","doi":"10.3847/1538-4365/acf76a","DOIUrl":null,"url":null,"abstract":"Abstract We develop a new model for automatic extraction of reported measurement values from the astrophysical literature, utilizing modern natural language processing techniques. We use this model to extract measurements present in the abstracts of the approximately 248,000 astrophysics articles from the arXiv repository, yielding a database containing over 231,000 astrophysical numerical measurements. Furthermore, we present an online interface ( Numerical Atlas ) to allow users to query and explore this database, based on parameter names and symbolic representations, and download the resulting data sets for their own research uses. To illustrate potential use cases, we then collect values for nine different cosmological parameters using this tool. From these results, we can clearly observe the historical trends in the reported values of these quantities over the past two decades and see the impacts of landmark publications on our understanding of cosmology.","PeriodicalId":8588,"journal":{"name":"Astrophysical Journal Supplement Series","volume":"50 5","pages":"0"},"PeriodicalIF":8.6000,"publicationDate":"2023-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Astrophysical Journal Supplement Series","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3847/1538-4365/acf76a","RegionNum":1,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract We develop a new model for automatic extraction of reported measurement values from the astrophysical literature, utilizing modern natural language processing techniques. We use this model to extract measurements present in the abstracts of the approximately 248,000 astrophysics articles from the arXiv repository, yielding a database containing over 231,000 astrophysical numerical measurements. Furthermore, we present an online interface ( Numerical Atlas ) to allow users to query and explore this database, based on parameter names and symbolic representations, and download the resulting data sets for their own research uses. To illustrate potential use cases, we then collect values for nine different cosmological parameters using this tool. From these results, we can clearly observe the historical trends in the reported values of these quantities over the past two decades and see the impacts of landmark publications on our understanding of cosmology.
期刊介绍:
The Astrophysical Journal Supplement (ApJS) serves as an open-access journal that publishes significant articles featuring extensive data or calculations in the field of astrophysics. It also facilitates Special Issues, presenting thematically related papers simultaneously in a single volume.