版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:Key Laboratory of Symbolic Computation and Knowledge Engineering College of Computer Science and Technology Jilin University Qianjin Street Jilin Changchun130012 China Department of Biomedical Informatics Ohio State University Lane Ave ColumbusOH43210 United States TUM School of Medicine Technical University of Munich Ismaninger Straße 22 Bavaria Munich81675 Germany Helmholtz Center Munich German Research Center for Environmental Health Ingolstadter Landstraße 1 Bavaria Neuherberg85764 Germany Department of Epidemiology and Biostatistics School of public health Jilin University Qianjin Street Jilin Changchun130012 China
出 版 物:《SSRN》
年 卷 期:2022年
核心收录:
摘 要:For many classification tasks, only experimentally validated positive samples are available, and experimentally validated negative samples are not recorded. The lack of negative samples poses a great challenge for using supervised machine learning. To address this problem, we propose a novel deep reinforcement learning based model to screen reliable negative samples from unlabeled samples, named SURE. SURE has two modules: sample selector and sample inspector. The sample selector screens reliable negative samples from unlabeled samples by two reinforcement strategies. The sample inspector classifies samples and provides rewards to the sample selector. In this paper, we focus on one popular issue in the field of bioinformatics: the ncRNA-protein interaction (NPI) prediction task, which lacks reliable negative samples. Thirty datasets for NPI prediction are used to test the screening effect of SURE. The Experimental results show that our model has a robust negative sample screening capability and is superior to all outstanding sample screening methods used in the NPI prediction task. In addition, we refine 5 NPI datasets containing reliable negative samples screened by SURE, and a webserver (***/sure) is available offering the NPI prediction refined by SURE. © 2022, The Authors. All rights reserved.