Conventional skeleton extraction methods require a closed boundary constraint to solve the problem. In natural images closed boundary constraint might not be easily satisfied due to the similarity of the object and it...
详细信息
ISBN:
(纸本)9781509040179
Conventional skeleton extraction methods require a closed boundary constraint to solve the problem. In natural images closed boundary constraint might not be easily satisfied due to the similarity of the object and its background, occlusion, etc. In this paper a novel approach based on the Delaunay triangulation to solve the skeleton extraction in natural images is proposed. The algorithm shows a promising result in extracting the skeleton in natural images.
Consistently, the classical image enlargement algorithm is a scientific analytical method for producing a better refined resolution image that is frequently required for advanced digital imageprocessing (DIP) from a ...
详细信息
ISBN:
(纸本)9781509048090
Consistently, the classical image enlargement algorithm is a scientific analytical method for producing a better refined resolution image that is frequently required for advanced digital imageprocessing (DIP) from a single lower resolution image that is frequently acquired from digital camera embedded system. Because of its less computation calculation, the Single-image Super-Resolution (SISR) that analytically applied for a single lower resolution image is one of the worldwide effective Super Resolution-Reconstruction (SRR) algorithms thus this paper proposes the image enlargement based on the SISR algorithm using high spectrum estimation and Tukey's Biweight constrain function. In general, the performance of this SISR algorithm is hinge on up to three parameters (b, h, k) however there are burdensome for determining these optimized values for these parameters (b, h, k). In order to solve this problem, the Tukey's Biweight constrain function, which is hinge on merely single parameter (T), contrary to three parameters like the classical constrain function, is engaged in the SISR algorithm. By examining on 14 benchmark images, which are profaned by considerable noise forms, in scientific analytical scrutinizing sector, the novel SISR algorithm illustrates that there is efficiently and effortlessly in parameter setting process but the performance of the novel SISR algorithm (with single parameters) is nearly equal to the original SISR (with three parameters). Due to greatly time reduction in the parameter setting process, this novel SISR algorithm is more advisable for real-time applications.
Recent models for learned image compression are based on autoencoders that learn approximately invertible mappings from pixels to a quantized latent representation. The transforms are combined with an entropy model, w...
Recent models for learned image compression are based on autoencoders that learn approximately invertible mappings from pixels to a quantized latent representation. The transforms are combined with an entropy model, which is a prior on the latent representation that can be used with standard arithmetic coding algorithms to generate a compressed bitstream. Recently, hierarchical entropy models were introduced as a way to exploit more structure in the latents than previous fully factorized priors, improving compression performance while maintaining end-to-end optimization. Inspired by the success of autoregressive priors in probabilistic generative models, we examine autoregressive, hierarchical, and combined priors as alternatives, weighing their costs and benefits in the context of image compression. While it is well known that autoregressive models can incur a significant computational penalty, we find that in terms of compression performance, autoregressive and hierarchical priors are complementary and can be combined to exploit the probabilistic structure in the latents better than all previous learned models. The combined model yields state-of-the-art rate–distortion performance and generates smaller files than existing methods: 15.8% rate reductions over the baseline hierarchical model and 59.8%, 35%, and 8.4% savings over JPEG, JPEG2000, and BPG, respectively. To the best of our knowledge, our model is the first learning-based method to outperform the top standard image codec (BPG) on both the PSNR and MS-SSIM distortion metrics.
In this paper, we propose a novel method for lane detection in real-time based on unsupervised learning using images from a camera mounted on the dashboard of a vehicle. Lane detection is a crucial element of an intel...
详细信息
ISBN:
(纸本)9781538615263
In this paper, we propose a novel method for lane detection in real-time based on unsupervised learning using images from a camera mounted on the dashboard of a vehicle. Lane detection is a crucial element of an intelligent vehicle safety system. Lane departure warning systems rely on accurate detection of lanes and are important for a driver assistant system. In our system, we extract the features of interest and detect the lane markings or foreground regions in each frame. We develop a spatio-temporal incremental clustering algorithm coupled with curve fitting for detecting the lanes on-the-fly. The spatio-temporal incremental clustering is performed over the detected foreground region for finding the lanes accurately. Each cluster represents a lane across space and time. The lanes are then identified by fitting curves on these clusters. Our system is capable of accurately detecting straight and curved lanes, noncontinuous and continuous lanes and is independent of number of lanes in the frame and the system is fast because we have not used any database for processing the images. Experimental results show that our algorithm can accurately perform lane detection in challenging scenarios.
A new technique of Mueller-matrix mapping of polycrystalline structure of histological sections of biological tissues is suggested. The algorithms of reconstruction of distribution of parameters of linear and circular...
详细信息
ISBN:
(数字)9781510612501
ISBN:
(纸本)9781510612501;9781510612495
A new technique of Mueller-matrix mapping of polycrystalline structure of histological sections of biological tissues is suggested. The algorithms of reconstruction of distribution of parameters of linear and circular dichroism of histological sections liver tissue of mice with different degrees of severity of diabetes are found. The interconnections between such distributions and parameters of linear and circular dichroism of liver of mice tissue histological sections are defined. The comparative investigations of coordinate distributions of parameters of amplitude anisotropy formed by Liver tissue with varying severity of diabetes (10 days and 24 days) are performed. The values and ranges of change of the statistical (moments of the 1st - 4th order) parameters of coordinate distributions of the value of linear and circular dichroism are defined. The objective criteria of cause of the degree of severity of the diabetes differentiation are determined.
We describe an object replacement approach whereby privacy-sensitive objects in videos are replaced by abstract cartoons taken from clip art. Our approach uses a combination of computer vision, deep learning, and imag...
详细信息
ISBN:
(纸本)9781538607336
We describe an object replacement approach whereby privacy-sensitive objects in videos are replaced by abstract cartoons taken from clip art. Our approach uses a combination of computer vision, deep learning, and imageprocessing techniques to detect objects, abstract details, and replace them with cartoon clip art. We conducted a user study (N=85) to discern the utility and effectiveness of our cartoon replacement technique. The results suggest that our object replacement approach preserves a video's semantic content while improving its privacy by obscuring details of objects.
This paper presents 2D imageprocessing approach to playback detection in automatic speaker verification ( ASV) systems using spectrograms as speech signal representation. Three feature extraction and classification m...
详细信息
ISBN:
(纸本)9781509046881
This paper presents 2D imageprocessing approach to playback detection in automatic speaker verification ( ASV) systems using spectrograms as speech signal representation. Three feature extraction and classification methods: histograms of oriented gradients ( HOG) with support vector machines ( SVM), HAAR wavelets with AdaBoost classifier and deep convolutional neural networks ( CNN) were compared on different data partitions in respect of speakers or playback devices: for instance with different speakers in training and test subsets. The playback detection systems were trained and tested on two speech datasets S-1 and S-2 manufactured independently by two different institutions. The test error for both datasets oscillates about the level of 1% for HOG+SVM and even below it for CNN in bigger S-1 base. In cross validation scenario in which one base was used for training and second base for the test the results were very poor what suggests that the information relevant for playback detection appeared in each base in different way.
Multiplier-free fast algorithms are derived and analyzed for realizing the 8-point discrete sine transform of type II and type vii (DST-II and DST-vii) transforms with applications in image and video compression. A ne...
详细信息
Multiplier-free fast algorithms are derived and analyzed for realizing the 8-point discrete sine transform of type II and type vii (DST-II and DST-vii) transforms with applications in image and video compression. A new fast algorithm is identified using numerical search methods for approximating DST-vii without employing multipliers. In addition, recently proposed fast algorithms for approximating the 8-point DCT-II are now extended to approximate DST-II. All proposed approximations for DST-II and DST-vii are compared with ideal transforms, and circuit complexity is measured using FPGA-based rapid prototypes on a 90nm Xilinx Virtex-4 device. The proposed architectures find applications in emerging video processing standards such as H.265/HEVC.
This paper presents an original framework based on deep learning and preference learning to retrieve and characterize biomedical images for assisting physicians in diagnosing complex diseases with potentially only sma...
详细信息
ISBN:
(纸本)9781509046034
This paper presents an original framework based on deep learning and preference learning to retrieve and characterize biomedical images for assisting physicians in diagnosing complex diseases with potentially only small differences between them. In particular, we use deep learning to extract the high-level and compact features for biomedical images. In contrast to the traditional biomedical algorithms or general image retrieval systems that only consider the use of pixel and/or hand-crafted features to represent images, we utilize deep neural networks for feature discovery of biomedical images. Moreover, in order to be able to index the similarly referenced images, we introduce preference learning in a novel way to learn what kinds of images we need so that we can obtain the similarity ranking list of biomedical images. We evaluate the performance of our system in detailed experiments over the well-known available OASIS-MRI database for whole brain neuroimaging as a benchmark and compare it with those of the traditional biomedical and general image retrieval approaches. Our proposed system exhibits an outstanding retrieval ability and efficiency for biomedical image applications.
Laser spot detection is an important problem in optical measurement. The precision and speed of the detection algorithm influence the optical measurement system directly. The traditional algorithms such as Hough Trans...
详细信息
ISBN:
(纸本)9783319495682;9783319495675
Laser spot detection is an important problem in optical measurement. The precision and speed of the detection algorithm influence the optical measurement system directly. The traditional algorithms such as Hough Transform and Gravity model are unsatisfactory in complex conditions. The laser spot detection algorithm referred in this paper is based on the quaternion discrete cosine transform and moment and a method is adopted to approximate the edge of the laser spot. Not only the center and edge can be detected simultaneously but also the robustness of noise is better than others. The algorithm is suitable for the real-time optical measurement.
暂无评论