Developing image-processingalgorithms based on machine learning is a challenging problem concerning the huge amount of thoroughly annotated data needed. The internet provides many already tagged images for basic clas...
详细信息
ISBN:
(纸本)9781509048052
Developing image-processingalgorithms based on machine learning is a challenging problem concerning the huge amount of thoroughly annotated data needed. The internet provides many already tagged images for basic classification problems like vegetables or different cars, but not for more narrow problems. In order to extend and evaluate the previously presented parking guidance system from our previous work, in this paper, we propose a simulation system based on Unreal Engine 4. We developed an artificial camera which implements all features of a real camera, e.g., lens distortion, motion blur etc. to export video data from the simulated environment. This data is then compared to real-world video footage by using our classification module that distinguishes occupied and free parking lots. We reached a classification rate between 92.28% and 99.72% depending on the parking rows' distance using DoG-features and a support vector machine.
This paper presents the design and implementation of dedicated hardware IP modules for background subtraction, which are suitable to be implemented in embedded vision systems and are efficient in terms of performance,...
详细信息
This paper presents the design and implementation of dedicated hardware IP modules for background subtraction, which are suitable to be implemented in embedded vision systems and are efficient in terms of performance, resource consumption, and operational speed. To achieve this goal, a comprehensive experimental study of different algorithms has been carried out by evaluating a wide range of quality parameters. From the results of this analysis, five candidate algorithms were selected and implemented using a model-based design methodology supported by Matlab and Xilinx FPGA tools. Using only the internal block memory available in the FPGA, they provide adequate solutions for processing low-resolution images with CIF and QCIF formats.
image sets and videos can be modeled as subspaces which are actually points on Grassmann manifolds. Clustering of such visual data lying on Grassmann manifolds is a hard issue based on the fact that the state-of-the-a...
详细信息
ISBN:
(纸本)9781538604915
image sets and videos can be modeled as subspaces which are actually points on Grassmann manifolds. Clustering of such visual data lying on Grassmann manifolds is a hard issue based on the fact that the state-of-the-art methods are only applied to vector space instead of non-Euclidean geometry. In this paper, we propose a novel algorithm termed as kernel sparse subspace clustering on the Grassmann manifold (GKSSC) which embeds the Grassmann manifold into a Reproducing Kernel Hilbert Space (RKHS) by an appropriate Gaussian projection kernel. This kernel is applied to obtain kernel sparse representations of data on Grassmann manifolds utilizing the self-expressive property and exploiting the intrinsic Riemannian geometry within data. Although the Grassmann manifold is compact, the geodesic distances between Grassmann points are well measured by kernel sparse representations based on linear reconstruction. With the kernel sparse representations, experimental results of clustering accuracy on the prevalent public dataset outperform state-of-the-art algorithms by more than 90 percent and the robustness of our algorithm is demonstrated as well.
The translation of the hit-or-miss transform (HMT) for grey-level images from the HMT for binary images is not simple. Initially established as a powerful tool for morphological binary imageprocessing, some generaliz...
详细信息
The translation of the hit-or-miss transform (HMT) for grey-level images from the HMT for binary images is not simple. Initially established as a powerful tool for morphological binary imageprocessing, some generalizations for grey-level images have been proposed in the literature. The difficulty lies in the definition of the complement of a given grey-level image. In this paper, after performing a detailed review of the different approaches proposed in the literature, including those based on fuzzy logic, we propose the definition of the hit-or-miss transform for grey-level images in the framework of the fuzzy mathematical morphology based on fuzzy conjunctions and implications under the duality with respect to a fuzzy negation approach. Some theoretical properties of this operator are studied when a general fuzzy conjunction is considered. In particular, we prove that this fuzzy morphological HMT reduces to the binary one when it is applied to binary images. After that, we focus on the fuzzy morphological HMT derived from t-norms and we introduce the concept of "part of an image" which will guide the study of the desirable properties of our operator and the interpretability of the obtained results. Some preliminary experimental results provide evidence of the potential of this tool to be feasible in design algorithms for detecting patterns. Moreover, some comparisons with other hit-or-miss transforms proposed in the literature are performed. (C) 2015 Elsevier B.v. All rights reserved.
In this paper, we proposed new framework for human action representation, which leverages the strengths of convolutional neural networks (CNNs) and the linear dynamical system (LDS) to represent both spatial and tempo...
详细信息
ISBN:
(纸本)9781509041183
In this paper, we proposed new framework for human action representation, which leverages the strengths of convolutional neural networks (CNNs) and the linear dynamical system (LDS) to represent both spatial and temporal structures of actions in videos. We make two principal contributions: first, we incorporate image-trained CNNs to detect action clip concepts, which takes advantage of different levels of information by combining the two layers in CNNs trained from images;Second, we further propose adopting a linear dynamical system (LDS) to model the relationships between these clip concepts, which captures temporal structures of actions. We have applied the proposed method on two challenging realistic benchmark datasets, and our method achieves high performance up to 86.16% on the YouTube and 82.76% UCF50 datasets, which largely outperforms most of the state-of-the-art algorithms with more sophisticated techniques.
Symmetric non-negative matrix factorization (SymNMF) has important applications in data analytics problems such as document clustering, community detection and image segmentation. In this paper, we propose a novel non...
详细信息
ISBN:
(纸本)9781509041183
Symmetric non-negative matrix factorization (SymNMF) has important applications in data analytics problems such as document clustering, community detection and image segmentation. In this paper, we propose a novel nonconvex variable splitting method for solving SymNMF. Different from the existing works, we prove that the algorithm converges to the set of Karush-Kuhn-Tucker (KKT) points of the nonconvex SymNMF problem with a global sublinear convergence rate. We also show that the algorithm can be efficiently implemented in a distributed manner. Further, we provide sufficient conditions that guarantee the global and local optimality of the obtained solutions. Extensive numerical results performed on both synthetic and real data sets suggest that the proposed algorithm yields high quality of the solutions and converges quickly to the set of local minimum solutions compared with other algorithms.
Nowadays there are many computer vision algorithms dedicated to solve the problem of object detection, from many different perspectives. Many of these algorithms take a considerable processing time even for low resolu...
详细信息
Lossy image compression algorithms are pervasively used to reduce the size of images transmitted over the web and recorded on data storage media. However, we pay for their high compression rate with visual artifacts d...
详细信息
ISBN:
(纸本)9781509061839
Lossy image compression algorithms are pervasively used to reduce the size of images transmitted over the web and recorded on data storage media. However, we pay for their high compression rate with visual artifacts degrading the user experience. Deep convolutional neural networks have become a widespread tool to address high-level computer vision tasks very successfully. Recently, they have found their way into the areas of low-level computer vision and imageprocessing to solve regression problems mostly with relatively shallow networks. We present a novel 12-layer deep convolutional network for image compression artifact suppression with hierarchical skip connections and a multi-scale loss function. We achieve a boost of up to 1.79 dB in PSNR over ordinary JPEG and an improvement of up to 0.36 dB over the best previous ConvNet result. We show that a network trained for a specific quality factor (QF) is resilient to the QF used to compress the input image - a single network trained for QF 60 provides a PSNR gain of more than 1.5 dB over the wide QF range from 40 to 76.
This article presents a novel segmentation algorithm that allows the automatic segmentation of masonry blocks from a 3D point cloud acquired with LiDAR technology, for both stationary and mobile devices. The point clo...
详细信息
This article presents a novel segmentation algorithm that allows the automatic segmentation of masonry blocks from a 3D point cloud acquired with LiDAR technology, for both stationary and mobile devices. The point cloud segmentation algorithm is based on a 2.5D approach that creates images based on the intensity attribute of LiDAR systems. imageprocessingalgorithms based on an improvement of the marked-controlled watershed was successfully used to produce the automatic segmentation of the point cloud in the 3D space isolating each individual stone block. Finally, morphologic analysis in two case studies has been carried out. The morphologic analysis provides information about the assemblage of masonry pieces, which is valuable for the structural evaluation of masonry buildings.
Traffic congestion remains a serious problem in transportation networks. Widely used navigation systems can only react to the presence of traffic jams but not to prevent their creation. One of the possibilities to pre...
Traffic congestion remains a serious problem in transportation networks. Widely used navigation systems can only react to the presence of traffic jams but not to prevent their creation. One of the possibilities to prevent congestion is to manage road traffic within the urban area. This work considers a route reservation approach with possibility to reroute a vehicle during a journey. This approach decomposes road segments into time-spatial slots and for every vehicle it makes the slots reservation for the corresponding route. Since the travel time in real networks cannot be determined precisely and can be considered as stochastic, we propose to use a rerouting procedure to minimize the traveling time. The experiments are carried out in microscopic simulation of a real-world traffic environment in the transportation network of Samara, Russia, using multi-agent transport simulation MATSim.
暂无评论