咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >Unsupervised Object Detection ... 收藏

Unsupervised Object Detection using Patch Based Image Classifier and Gradient Importance Map

作     者:Jain, Vanita Pillai, Manu S. Jain, Achin Dubey, Arun Kumar 

作者机构:Department of Electronic Science University of Delhi Delhi India Center for Research in Computer Vision University of Central Florida Orlando United States Bharati Vidyapeeth’s College of Engineering New Delhi India 

出 版 物:《International Journal of Information Technology (Singapore)》 (Int. J. Inf. Technol.)

年 卷 期:2025年第17卷第4期

页      面:2407-2416页

主  题:Deep learning Grad-CAM Image classification Object detection 

摘      要:Image classification in computer vision has seen tremendous amount of success in recent years. Deep learning has played a pivotal role in achieving human level performance in many image recognition challenges and benchmarks. Even though, image classification has been so successful, no other closely related domains have taken advantage from the efforts put into development of image classification methods. One such closely related field is of Object Detection. Object detection or localisation is a computer vision problem whose solutions have not been victorious enough to human level performance. Many challenges arise when developing object detection models for newly generated domains, one of which is labelling of datasets. Preparation of dataset is one of the most cumbersome and expensive task to accomplish while developing an object detection model. Although, image classifiers are used as a feature extractor in object detection training regimes, their localisation abilities are barely studied. In this paper, we propose an object detection training regime, that does not rely on bounding box labelled datasets, hence unsupervised in nature, and is solely based on trained image classifiers. We build up on our hypothesis, that, if an image classifier is able to predict what object is in the input image, then it must have information about where the object is, we just need a mechanism to extract that information from it. Precisely, we divide the input image into patches of same size and employ a parameter restricted convolutional classifier on each patches to predict whether it contains the object or not, we call this our patch-based image classifier (the object here is the prediction of the trained image classifier). The training of the patch-based classifier is not straightforward as there is no true labels for each patches on which we can reduce the binary cross-entropy. Therefore, we propose a loss function weighted by the importance map, which we generate using Grad

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分