版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:School of Artificial Intelligence and Data Science University of Science and Technology of China Hefei230026 China CAS Key Laboratory for Research in Galaxies and Cosmology Department of Astronomy University of Science and Technology of China Hefei230026 China School of Astronomy and Space Science University of Science and Technology of China Hefei230026 China Deep Space Exploration Laboratory Hefei230088 China McWilliams Center for Cosmology Department of Physics Carnegie Mellon University 5000 Forbes Ave PittsburghPA15213 United States Shanghai AI Laboratory Shanghai200232 China Purple Mountain Observatory Nanjing210023 China
出 版 物:《arXiv》 (arXiv)
年 卷 期:2024年
核心收录:
摘 要:Context. The mounting data stream of large time-domain surveys renders the visual inspections of a huge set of transient candidates impractical. Techniques based on deep learning-based are popular solutions for minimizing human intervention in the time domain community. The classification of real and bogus transients is a fundamental component in real-time data processing systems and is critical to enabling rapid follow-up observations. Most existing methods (supervised learning) require sufficiently large training samples with corresponding labels, which involve costly human labeling and are challenging in the early stages of a time-domain survey. One method that can make use of training samples with access to only a limited amount of labels is highly desirable for future large time-domain surveys. These include the forthcoming 2.5-meter Wide-Field Survey Telescope (WFST) six-year survey and the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST). Aims. Deep-learning-based methods have been favored in astrophysics owing to their adaptability and remarkable performance. They have been applied to the task of the classification of real and bogus transients. Unlike most existing approaches, which necessitate massive and expensive annotated data, we aim to leverage training samples with only 1000 labels and discover real sources that vary in brightness over time in the early stages of the WFST six-year survey. Methods. We present a novel deep learning method that combines active learning and semi-supervised learning to construct a competitive real-bogus classifier. Our method incorporates an active learning stage, where we actively select the most informative or uncertain samples for annotation. This stage aims to achieve higher model performance by leveraging fewer labeled samples, thus reducing annotation costs and improving the overall learning process efficiency. Furthermore, our approach involves a semi-supervised learning stage that exploits the unlab