Document classification is a relevant task within every intelligent document processing system. With the advances in deep learning andcomputervision techniques, this task has become a painless and straightforward pr...
详细信息
At present, the research of image classification mainly focuses on the detection and classification of objects. However, most of the current classification methods still rely on a large number of pixel level manual an...
详细信息
ISBN:
(纸本)9781728199481
At present, the research of image classification mainly focuses on the detection and classification of objects. However, most of the current classification methods still rely on a large number of pixel level manual annotation samples, which often has a greater impact on practical problems. This paper will focus on the task of image classification in the weakly supervised mode. By combining attention mechanism with convolution neural network, the model can extract salient regions in the image circularly, and then use the multi branch VGG deep neural network model to adjust the classification of the extracted features, so that the model can focus on the features of the salient region, so as to improve the ability of this model. The experimental results show that the improved weak supervised learning model can achieve 87.2% and 85.7% classification accuracy under the premise of reducing the use of artificial features, which means that it can achieve better image classification effect when compared with other advanced image classification methods.
Face information is important information for identifying people, but at low resolution, face information is not well recognized. A psychological physics experiment of person recognition in daily life was designed to ...
详细信息
In this paper, a new approach for the detection and classification of potato plant disease is implemented using computervision techniques. Most of the existing algorithms based on plant disease detection and classifi...
详细信息
The proceedings contain 83 papers. The special focus in this conference is on Machine Learning, imageprocessing, Network Security and Data Sciences. The topics include: Distance Invariant RGB-D Object Recognition Usi...
ISBN:
(纸本)9789811563140
The proceedings contain 83 papers. The special focus in this conference is on Machine Learning, imageprocessing, Network Security and Data Sciences. The topics include: Distance Invariant RGB-D Object Recognition Using DSMS System;Single Look SAR image Segmentation Using Local Entropy, Median Low-Pass Filter and Fuzzy Inference System;An Edge Detection and Sliding Window Based Approach to Improve Object Localization in YOLOv3;automatic Detection of Pneumonia from Chest X-Rays Using Deep Learning;an Improved Accuracy Rate in Microaneurysms Detection in Retinal Fundus images Using Non-local Mean Filter;emotion Recognition from Periocular Features;a Comparative Study of Computational Intelligence for Identification of Breast Cancer;segmentation of Blood Vessels from Fundus image Using Scaled Grid;head Pose Estimation of Face: Angle of Roll, Yaw, and Pitch of the Face image;identifying Fake Profile in Online Social Network: An Overview and Survey;enhancing and Classifying Traffic Signs Using computervision and Deep Convolutional Neural Network;diabetic Retinopathy Detection on Retinal Fundus images Using Convolutional Neural Network;medical image Fusion Based on Deep Decomposition and Sparse Representation;rice Plant Disease Detection and Classification Using Deep Residual Learning;Flood Detection Using Multispectral images and SAR Data;Handwritten Character Recognition Using KNN and SVM Based Classifier over Feature Vector from Autoencoder;a ConvNet Based Procedure for image Copy-Move Forgery Detection;a Novel Invisible Watermarking Approach for Double Authentication of image Content;remote Sensing Signature Classification of Agriculture Detection Using Deep Convolution Network Models;buddhist Hasta Mudra Recognition Using Morphological Features.
The emergence of deep learning in the field of computervision has led to extensive deployment of convolutional neural networks (CNNs) in visual recognition systems for feature extraction. CNNs provide learning throug...
详细信息
ISBN:
(纸本)9783030294076;9783030294069
The emergence of deep learning in the field of computervision has led to extensive deployment of convolutional neural networks (CNNs) in visual recognition systems for feature extraction. CNNs provide learning through hierarchical inferencing by providing multilayer architecture. Due to high processing capability of CNNs in multidimensional signals like images, they are considered to be predominant artificial neural networks. CNNs are extensively used in computervision such as in image recognition where the intent is to automatically learn features followed by generalization and eventually recognizing the learned features. In this paper, we investigate the efficiency of CNNs: AlexNet and GoogLeNet under the effect of blurring which occurs frequently during image capturing process. Here, Gaussian blurring is used since it minimizes the noise embedded into the image. For experimentation, UC Merced Land Use aerial dataset is used to evaluate CNNs' performance. The focus is to train these CNNs and classifying an extensive range of classes accurately under the influence of Gaussian blurring. Accuracy and loss are the parameters of classification considered for evaluating the performance of CNNs. Experimental results validated the susceptibility of CNNs towards blurring effect with GoogLeNet being more fluctuating to varied degrees of Gaussian blurring than AlexNet.
Aiming at the situation that QR code identification and RFID identification cannot effectively used in aluminum profile warehouse management, it is proposed to use imageprocessing technology to compare and identify a...
详细信息
Classification of algae is one of challenging task due to its micro-size and similarity in shapes. Therefore, the manual taxonomic classification of algae is very difficult and needs high-level expertise. In this case...
详细信息
Retinal Vessel segmentation is an indispensable part of the task of the automatic detection of retinopathy through fundus images, while there are several challenges, such as lots of noise, low distinction between bloo...
详细信息
ISBN:
(纸本)9781728199481
Retinal Vessel segmentation is an indispensable part of the task of the automatic detection of retinopathy through fundus images, while there are several challenges, such as lots of noise, low distinction between blood vessels and environment, and uneven distribution of thick and thin blood vessels. Deep learning-based methods represented by U-Net performs very well on the task of retinal vessel segmentation. As the attention mechanism has made breakthroughs in many computervision tasks, it has attracted the attention from the researcher. This paper proposed a kind of U-Net network based on triple attention mechanism-3AU-Net to overcome the problems of retinal vessel segmentation. We follow the framework of U-Net 's full convolution and skip connection, integrating spatial attention mechanism with channel attention mechanism and context attention mechanism. Spatial attention allows the segmentation network to find the blood vessel region that needs attention, thereby suppressing noise. Channel attention can make the expression of features more diverse and highlight the feature channels with key information. The context attention can integrate the context information to make the network to focus on the key pixels. Experimental consequences have indicated that 3AU-Net can greatly improve the results of the segmentation of retinal blood vessels, and this method surpasses other deep learning-based methods in many indicators on the DRIVE and STARE fundus image data sets. On the DRIVE data set, the 3A-UNet model achieved excellent performance on multiple evaluation indicators, with an ACC score of 0.9592, an AUC score of 0.9770, and a sensitivity score of 0.8537.
Visual Question Answering is a perfect mix of issues enveloping different spaces including Natural Language processing, computervision and knowledge portrayal. The problem involves giving an image and a natural langu...
详细信息
暂无评论