Computational photography systems are becoming increasingly diverse, while computational resources-for example on mobile platforms-are rapidly increasing. As diverse as these camera systems may be, slightly different ...
详细信息
ISBN:
(纸本)9781450342797
Computational photography systems are becoming increasingly diverse, while computational resources-for example on mobile platforms-are rapidly increasing. As diverse as these camera systems may be, slightly different variants of the underlying imageprocessing tasks, such as demosaicking, deconvolution, denoising, inpainting, image fusion, and alignment, are shared between all of these systems. Formal optimization methods have recently been demonstrated to achieve state-of-the-art quality for many of these applications. Unfortunately, different combinations of natural image priors and optimization algorithms may be optimal for different problems, and implementing and testing each combination is currently a time-consuming and error-prone process. ProxImaL is a domain-specific language and compiler for image optimization problems that makes it easy to experiment with different problem formulations and algorithm choices. The language uses proximal operators as the fundamental building blocks of a variety of linear and nonlinear image formation models and cost functions, advanced image priors, and noise models. The compiler intelligently chooses the best way to translate a problem formulation and choice of optimization algorithm into an efficient solver implementation. In applications to the imageprocessing pipeline, deconvolution in the presence of Poisson-distributed shot noise, and burst denoising, we show that a few lines of ProxImaL code can generate highly efficient solvers that achieve state-of-the-art results. We also show applications to the nonlinear and nonconvex problem of phase retrieval.
image technologies nowadays are used not only for keeping personal events safe, but also are widely applied in conjunction with automated electronic systems. Computer vision is widely used for inspection of the produc...
详细信息
ISBN:
(纸本)9781509018666
image technologies nowadays are used not only for keeping personal events safe, but also are widely applied in conjunction with automated electronic systems. Computer vision is widely used for inspection of the production quality in industries. Food industry is not an exception. Containers for food industry are made in very large quantities. This article contains of defect analysis of both external and side area of the bottleneck. Defects were divided into groups according to which the filters are created. For the control of PET preparation quality an automated computer vision algorithms were developed. The algorithms and methods were used for the detection of defective products mainly based on the image segmentation, digital production, erosion, smoothing. The most effective filters for the defect detection of the workpieces have been determined. It was carried out that efficiency of algorithms are close to 100 %.
Ultra low delay video transmission is becoming increasingly important. Video-based applications with ultra low delay requirements range from teleoperation scenarios such as controlling drones or telesurgery to autonom...
详细信息
ISBN:
(纸本)9781467399616
Ultra low delay video transmission is becoming increasingly important. Video-based applications with ultra low delay requirements range from teleoperation scenarios such as controlling drones or telesurgery to autonomous control of dynamic processes using computer vision algorithms applied on real-time video. To evaluate the performance of the video transmission chain in such systems, it is important to be able to precisely measure the glass-to-glass (G2G) delay of the transmitted video. In this paper, we present a low-complexity system that takes a series of pairwise independent measurements of G2G delay and derives performance metrics such as mean delay or minimum delay etc. from the data. The precision is in the sub-millisecond range, mainly limited by the sampling rate of the measurement system. In our implementation, we achieve a G2G measurement precision of 0.5 milliseconds with a sampling rate of 2kHz.
Symmetric non-negative matrix factorization (SymNMF) has important applications in data analytics problems such as document clustering, community detection and image segmentation. In this paper, we propose a novel non...
详细信息
ISBN:
(纸本)9781509041183
Symmetric non-negative matrix factorization (SymNMF) has important applications in data analytics problems such as document clustering, community detection and image segmentation. In this paper, we propose a novel nonconvex variable splitting method for solving SymNMF. Different from the existing works, we prove that the algorithm converges to the set of Karush-Kuhn-Tucker (KKT) points of the nonconvex SymNMF problem with a global sublinear convergence rate. We also show that the algorithm can be efficiently implemented in a distributed manner. Further, we provide sufficient conditions that guarantee the global and local optimality of the obtained solutions. Extensive numerical results performed on both synthetic and real data sets suggest that the proposed algorithm yields high quality of the solutions and converges quickly to the set of local minimum solutions compared with other algorithms.
image pattern recognition is an important area in digital imageprocessing. An efficient pattern recognition algorithm should be able to provide correct recognition at a reduced computational time. Off late amongst th...
详细信息
ISBN:
(纸本)9781509010660
image pattern recognition is an important area in digital imageprocessing. An efficient pattern recognition algorithm should be able to provide correct recognition at a reduced computational time. Off late amongst the machine learning pattern recognition algorithms, Artificial fish swarm algorithm is one of the swarm intelligence optimization algorithms that works based on population and stochastic search. In order to achieve acceptable result, there are many parameters needs to be adjusted in AFSA. Among these parameters, visual and step are very significant in view of the fact that artificial fish basically move based on these parameters. In standard AFSA, these two parameters remain constant until the algorithm termination. Large values of these parameters increase the capability of algorithm in global search, while small values improve the local search ability of the algorithm. In this paper, we empirically study the performance of the AFSA and different approaches to balance between local and global exploration have been tested based on the adaptive modification of visual and step during algorithm execution. The proposed approaches have been evaluated based on the four well-known benchmark functions. Experimental results show considerable positive impact on the performance of AFSA. A Convex optimization has been integrated into the proposed work to have an ideal segmentation of the input image which is a MR brain image.
The artificial vision is a part of the artificial intelligence that pretends to simulate the human vision, is to say, from the acquisition, processing, analysis and interpretation of images through an intelligent syst...
详细信息
ISBN:
(纸本)9781509050475
The artificial vision is a part of the artificial intelligence that pretends to simulate the human vision, is to say, from the acquisition, processing, analysis and interpretation of images through an intelligent system. This work presents the creation of prototypes under the game jam model as a software product. In this context, the objective of the present work was to apply basic artificial vision algorithms such as linear discriminant analysis (LDA), principal component analysis (PCA), Fisherface, Otsu, CamShift and color spaces such as RGB and HSV in order to be able to motion detection of objects, face recognition and pedestrian detection. As a result of applying this model in rapid prototyping, we found significant factors (such as: participatory design, light construction, product value approach, aesthetics and technology) in the implementation of innovative strategies in creating of prototypes focused on Software development.
Lossy image compression algorithms are pervasively used to reduce the size of images transmitted over the web and recorded on data storage media. However, we pay for their high compression rate with visual artifacts d...
详细信息
ISBN:
(纸本)9781509061839
Lossy image compression algorithms are pervasively used to reduce the size of images transmitted over the web and recorded on data storage media. However, we pay for their high compression rate with visual artifacts degrading the user experience. Deep convolutional neural networks have become a widespread tool to address high-level computer vision tasks very successfully. Recently, they have found their way into the areas of low-level computer vision and imageprocessing to solve regression problems mostly with relatively shallow networks. We present a novel 12-layer deep convolutional network for image compression artifact suppression with hierarchical skip connections and a multi-scale loss function. We achieve a boost of up to 1.79 dB in PSNR over ordinary JPEG and an improvement of up to 0.36 dB over the best previous ConvNet result. We show that a network trained for a specific quality factor (QF) is resilient to the QF used to compress the input image - a single network trained for QF 60 provides a PSNR gain of more than 1.5 dB over the wide QF range from 40 to 76.
Increasing spatial resolution is often required in many applications such as entertainment systems or video surveillance. Apart from using higher resolution sensors, it is also possible to apply super resolution algor...
详细信息
ISBN:
(纸本)9781467399616
Increasing spatial resolution is often required in many applications such as entertainment systems or video surveillance. Apart from using higher resolution sensors, it is also possible to apply super resolution algorithms to realize an increased resolution. Those methods can be divided into approaches that rely on only a single low resolution image or on multiple low resolution video frames. While incorporating more frames into the super-resolution is beneficial for the resolution enhancement in principle, it is also likely to introduce more artifacts from inaccurate motion estimation. To alleviate this problem, various weightings have been proposed in the literature. In this paper, we propose an extended dual weighting scheme for an interpolation-based super-resolution method based on Voronoi tessellation that relies on both a motion confidence weight and a distance weight. Compared to non-weighted super-resolution, the proposed method yields an average gain in luminance PSNR of up to 1.29 dB and 0.61 dB for upscaling factors of 2 and 4, respectively. Visual comparisons substantiate the objective results.
OCR is the most active, interesting evaluation invention of text cum character processing recognition and pattern based image recognition. In present life OCR has been successfully using in finance, legal, banking, he...
详细信息
ISBN:
(纸本)9781467385879
OCR is the most active, interesting evaluation invention of text cum character processing recognition and pattern based image recognition. In present life OCR has been successfully using in finance, legal, banking, health care and home need appliances. The OCR consists the different levels of processing methods like as image Pre Acquisition, Classification, Post-Acquisition, Pre-Level processing, Segmented processing, Post-Level processing, Feature Extraction. The many researchers are proposed various levels of different methodologies and approaches in different versions of languages with help of modern and traditional technologies. This paper expressed the detail study and analysis of various character recognition methods and approaches: in details like as flow and type of approached methodology was used, type of algorithm has built with support of technology has implemented background of the proposed methodology and invention best outcomes flow for the each methodology. This paper and also expressed the main objectives and ideology of various OCR algorithms, like as neural networks algorithm, structural algorithm, support vector algorithm, statistical algorithm, template matching algorithm along with how they classified, identified, rule formed, inferred for recognition of characters and symbols.
In this paper we present a new, publicly available database of color, high resolution images useful in evaluation of various algorithms in the field of video surveillance. The additional data provided with the images ...
详细信息
ISBN:
(纸本)9783319238142;9783319238135
In this paper we present a new, publicly available database of color, high resolution images useful in evaluation of various algorithms in the field of video surveillance. The additional data provided with the images facilitates the evaluation of tracking, recognition and reidentification across sequences of images.
暂无评论