Predicting Pulsars from Imbalanced Dataset with Hybrid Resampling Approach

IF 1.6 4区 物理与天体物理 Q3 ASTRONOMY & ASTROPHYSICS Advances in Astronomy Pub Date : 2021-12-03 DOI:10.1155/2021/4916494
Ernesto Lee, F. Rustam, Wajdi Aljedaani, Abid Ishaq, Vaibhav Rupapara, I. Ashraf
{"title":"Predicting Pulsars from Imbalanced Dataset with Hybrid Resampling Approach","authors":"Ernesto Lee, F. Rustam, Wajdi Aljedaani, Abid Ishaq, Vaibhav Rupapara, I. Ashraf","doi":"10.1155/2021/4916494","DOIUrl":null,"url":null,"abstract":"Pulsar stars, usually neutron stars, are spherical and compact objects containing a large quantity of mass. Each pulsar star possesses a magnetic field and emits a slightly different pattern of electromagnetic radiation which is used to identify the potential candidates for a real pulsar star. Pulsar stars are considered an important cosmic phenomenon, and scientists use them to study nuclear physics, gravitational waves, and collisions between black holes. Defining the process of automatic detection of pulsar stars can accelerate the study of pulsar stars by scientists. This study contrives an accurate and efficient approach for true pulsar detection using supervised machine learning. For experiments, the high time-resolution (HTRU2) dataset is used in this study. To resolve the data imbalance problem and overcome model overfitting, a hybrid resampling approach is presented in this study. Experiments are performed with imbalanced and balanced datasets using well-known machine learning algorithms. Results demonstrate that the proposed hybrid resampling approach proves highly influential to avoid model overfitting and increase the prediction accuracy. With the proposed hybrid resampling approach, the extra tree classifier achieves a 0.993 accuracy score for true pulsar star prediction.","PeriodicalId":48962,"journal":{"name":"Advances in Astronomy","volume":" ","pages":""},"PeriodicalIF":1.6000,"publicationDate":"2021-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Astronomy","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1155/2021/4916494","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 5

Abstract

Pulsar stars, usually neutron stars, are spherical and compact objects containing a large quantity of mass. Each pulsar star possesses a magnetic field and emits a slightly different pattern of electromagnetic radiation which is used to identify the potential candidates for a real pulsar star. Pulsar stars are considered an important cosmic phenomenon, and scientists use them to study nuclear physics, gravitational waves, and collisions between black holes. Defining the process of automatic detection of pulsar stars can accelerate the study of pulsar stars by scientists. This study contrives an accurate and efficient approach for true pulsar detection using supervised machine learning. For experiments, the high time-resolution (HTRU2) dataset is used in this study. To resolve the data imbalance problem and overcome model overfitting, a hybrid resampling approach is presented in this study. Experiments are performed with imbalanced and balanced datasets using well-known machine learning algorithms. Results demonstrate that the proposed hybrid resampling approach proves highly influential to avoid model overfitting and increase the prediction accuracy. With the proposed hybrid resampling approach, the extra tree classifier achieves a 0.993 accuracy score for true pulsar star prediction.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用混合重采样方法预测不平衡数据集中的脉冲星
脉冲星,通常是中子星,是球状的致密天体,含有大量的质量。每颗脉冲星都有一个磁场,并发射出一种略有不同的电磁辐射模式,这种模式被用来识别真正脉冲星的潜在候选者。脉冲星被认为是一种重要的宇宙现象,科学家们利用它们来研究核物理学、引力波和黑洞之间的碰撞。确定脉冲星自动探测的过程,可以加快科学家对脉冲星的研究。本研究设计了一种使用监督机器学习进行真正脉冲星探测的准确有效的方法。实验采用高时间分辨率(HTRU2)数据集。为了解决数据不平衡问题和克服模型过拟合问题,本文提出了一种混合重采样方法。使用著名的机器学习算法对不平衡和平衡数据集进行实验。结果表明,混合重采样方法对避免模型过拟合和提高预测精度有很大的影响。采用混合重采样方法,额外树分类器对真实脉冲星的预测准确率达到0.993。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Advances in Astronomy
Advances in Astronomy ASTRONOMY & ASTROPHYSICS-
CiteScore
2.70
自引率
7.10%
发文量
10
审稿时长
22 weeks
期刊介绍: Advances in Astronomy publishes articles in all areas of astronomy, astrophysics, and cosmology. The journal accepts both observational and theoretical investigations into celestial objects and the wider universe, as well as the reports of new methods and instrumentation for their study.
期刊最新文献
A Study of the Early Cosmic Dynamics in a Multifield Model of Inflation and Curvature Perturbations Forecasting Ionospheric TEC Changes Associated with the December 2019 and June 2020 Solar Eclipses: A Comparative Analysis of OKSM, FFNN, and DeepAR Models Measuring Track-Related Pointing Errors on the Nanshan Radio Telescope with an Optical Pointing Telescope Tracking and Disturbance Suppression of the Radio Telescope Servo System Based on the Equivalent-Input-Disturbance Approach Dark Energy from Cosmological Energy Conservation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1