Automatic image annotation and tagging is necessary for indexing and searching of images using querying a text. It is widely used in search engines like Google, Yahoo, Baidu, etc. Fast image Tagging (FastTag) algorith...
详细信息
Automatic image annotation and tagging is necessary for indexing and searching of images using querying a text. It is widely used in search engines like Google, Yahoo, Baidu, etc. Fast image Tagging (FastTag) algorithm is proposed to accelerate image annotation process, while keeping the precision of automatic image annotation results. Feature mapping is used to map image features vectors onto higher dimensional feature space. Feature mapping methods plays an important role in automatic image annotation. In this paper, we have compared 6 kernels, among which four kernels are used in homogeneous feature mapping and two kernels are used in discriminative tree based feature mapping, to investigate which feature mapping performs better for automatic image annotation. The performance of these methods has been analyzed by conducting intensive experiments on three different datasets as used by FastTag algorithm in their experiments. We have found that the homogeneous feature mapping with χ 2 kernel is more suitable when used in FastTag algorithm in terms of precision, recall, FI score and N+ measures, and with a relatively acceptable performance.
Depth estimation plays an important role in robotic perception systems. The self-supervised monocular paradigm has gained significant attention since it can free training from the reliance on depth annotations. Despit...
详细信息
The images captured in fog conditions have degraded contrast,that makes current imageprocessing applications sensitive and error *** propose in this paper an efficient single image enhancement algorithm suitable for ...
详细信息
The images captured in fog conditions have degraded contrast,that makes current imageprocessing applications sensitive and error *** propose in this paper an efficient single image enhancement algorithm suitable for daytime fog conditions and based on an original mathematical model,for computing the atmospheric veil,that takes into account the variation in fog density to the *** model is inspired by the functions that appear in partition of unity in the differential geometry *** observing images captured in fog conditions,usually the fog has a very low density in front of the camera and this density has a non-linear increase with the distance,such that objects are no longer visible at greater *** using our mathematical model we are able to obtain superior reconstructions of the original fog-free image,when comparing to traditional *** advantage of our method is the ability to adapt the model in accordance to the density of the fog.A quantitative and qualitative evaluation is performed on both synthetic and real camera *** evaluation proves that our mathematical model is more suitable for image enhancement in both homogeneous and heterogeneous fog *** algorithm is able to perform image enhancement in real time for both color and gray scale images.
Artwork Generation is an important research area of computer vision. Recently, kinds of generative models have achieved great success in natural image generation. However, artwork generation has rarely been studied du...
详细信息
Deep Neural Networks (DNNs) are susceptible to elaborately designed perturbations, whether such perturbations are dependent or independent of images. The latter one, called Universal Adversarial Perturbation (UAP), is...
Deep Neural Networks (DNNs) are susceptible to elaborately designed perturbations, whether such perturbations are dependent or independent of images. The latter one, called Universal Adversarial Perturbation (UAP), is very attractive for model robustness analysis, since its independence of input reveals the intrinsic characteristics of the model. Relatively, another interesting observation is Neural Collapse (NC), which means the feature variability may collapse during the terminal phase of training. Motivated by this, we propose to generate UAP by attacking the layer where NC phenomenon happens. Because of NC, the proposed attack could gather all the natural images’ features to its surrounding, which is hence called Feature-Gathering UAP (FG-UAP). We evaluate the effectiveness our proposed algorithm on abundant experiments, including untargeted and targeted universal attacks, attacks under limited dataset, and transfer-based black-box attacks among different architectures including Vision Transformers, which are believed to be more robust. After that, we empirically verify the effectiveness of NC's conclusion on UAP by attacking on only 10% of the dataset while keeping comparable performance. Finally, we investigate FG-UAP in the view of NC by analyzing the labels and extracted features of adversarial examples, finding that collapse phenomenon becomes stronger after the model is corrupted. Codes for the project are available at https://***/yzx1213/FG-UAP.
An important step in the analysis of printed documents is the segmentation and classification of blocks into categories such as photographs, titles, paragraphs, etc. The authors present an approach to enhance and comb...
详细信息
An important step in the analysis of printed documents is the segmentation and classification of blocks into categories such as photographs, titles, paragraphs, etc. The authors present an approach to enhance and combine two commonly used methods, a merging bottom-up approach and a cutting top down approach, to segment pages of a newspaper. The implementation of a layout analysis system as a preprocessing module for a commercial product is described.< >
In this paper,we presented a displacement field estimation algorithm based on a relaxed smoothness constraint;this algorithmcan preserve discontinuities in the displacement field to some *** image data is irregular an...
详细信息
In this paper,we presented a displacement field estimation algorithm based on a relaxed smoothness constraint;this algorithmcan preserve discontinuities in the displacement field to some *** image data is irregular and the images are noisy,the method produces some big residual errors in the residual *** this paper we propose an improved displacement field estimation algorithm which uses the displacement information obtained using blockmatching to modify the matching *** results show,this leads to smaller residual error maps, without introducing block artefacts,as would happen in the case of simple block matching when there is much noise in the *** the displacement filed using this method is more consistent than using a method without additional block matching.
Multi-class segmentation assigns a class label to each pixel in an image. It represents a significant task for the semantic understanding of images and has received plentiful attention over the last years. The current...
详细信息
ISBN:
(纸本)9781479936397
Multi-class segmentation assigns a class label to each pixel in an image. It represents a significant task for the semantic understanding of images and has received plentiful attention over the last years. The current state of art is dominated by conditional random field based approaches, defined over pixels or image segments. However, high accuracy segmentation comes at a high computational cost. The best performing methods can barely run at few frames per second and are far from real-time applications. Our goal is to bridge the gap between current state of the art segmentation approaches and real-time applications. In this paper we propose an efficient approach for individual pixel classification. Multiple local descriptors are computed densely and then quantized using visual codebooks. Joint boosting is used to classify each pixel based on the quantized local descriptors. We show that using careful design choices and GPU optimization we can achieve sate of the art segmentation results at over 50 FPS. We also propose a Conditional Random Field (CRF) model defined over superpixels that uses the proposed pixel classifier for the estimation of unary potentials. The CRF based multi-class segmentation can run at over 30 FPS. The proposed approach is validated on the MSRC21 and CamVid multi-class segmentation benchmarks, the former one consisting of urban traffic sequences.
This paper proposes a novel method for detecting the moving vehicles in dynamic urban traffic scenes using a stereo camera. Relying on the fact that a set of feature points on a rigid 3D scene object are staying in a ...
详细信息
This paper proposes a novel method for detecting the moving vehicles in dynamic urban traffic scenes using a stereo camera. Relying on the fact that a set of feature points on a rigid 3D scene object are staying in a rigid 3D configuration, we propose to compute the relative motion between the camera and a moving object with an algorithm that follows from the visual odometry based motion estimation methods. Subtracting the camera motion we obtain the absolute object motion. Additionally we create a compact representation of the scene using superpixels computed from intensity and depth information. A graph-like structure is built, having superpixels as nodes and indicating neighboring relationships between adjacent superpixels. Objects are segmented using a fast region growing algorithm that considers as seeds the features used to compute the object motion.
This paper presents uncertainty propagation in landmark based position estimation methods. Analysis of two methods has been carried out where robot position is estimated by detecting one or two globally distinct featu...
详细信息
ISBN:
(纸本)9784901122078
This paper presents uncertainty propagation in landmark based position estimation methods. Analysis of two methods has been carried out where robot position is estimated by detecting one or two globally distinct features using a pivoted stereo vision system. We make a basic assumption about error in estimating point features in camera images and propagate it into robot position estimate using first order approximation of non-linear functions. Simulation results illustrate the performance of the method.
暂无评论