An image in general consists of a combination of significant objects in the foreground and not-so-significant objects in the background. Content aware image resizing or seam carving is a process of resizing an image w...
详细信息
ISBN:
(纸本)9781538636251
An image in general consists of a combination of significant objects in the foreground and not-so-significant objects in the background. Content aware image resizing or seam carving is a process of resizing an image while maintaining the significant objects (the foreground) in proper visual saliency. The standard algorithms, however, often generate unpredictable distortions in images with densely situated foreground objects. The optimized content aware image resizing (OCAIR) algorithm presented herein, uses iterative graph cuts and edge detection to generate an energy map based on the important sections of the image, so that the resized image does not exhibit unpredictable artefacts. An improved energy map generation algorithm is designed here, which not only marks out the important foreground elements quicker than previously available techniques, but also uses that information to quantity the amount of distortion (if any) that might take place after adding or deleting seams by means of calculating a distortion factor. The process being considerably faster than previous algorithms, allows precise modifications to the input parameters to obtain a well-doctored final image.
Content-based image representation is a very challenging task if we restrict to their visual content. However, associated metadata (such as tags or geolocation) become a valuable source of complementary information th...
详细信息
Content-based image representation is a very challenging task if we restrict to their visual content. However, associated metadata (such as tags or geolocation) become a valuable source of complementary information that may help to enhance the current system performance. In this paper, we propose an automatic training framework that uses both imagevisual contents and metadata to fine tune deep Convolutional Neural Networks (CNNs), providing better image descriptors adapted to certain locations, such as cities or regions. Specifically, we propose to estimate some weak labels by combining visual- and location-related information and incorporate them to a novel loss-function over pairs of images. Our experiments on a landmark discovery task show that this novel training procedure enhances the performance up to a 55% over well-established CNN-based models and is free from overfitting.
In this work, a novel bit allocation method based on visual attention and distortion sensitivity is developed for JPEG2000. Although, visual attention map for an image can be measured by using well-known saliency map ...
详细信息
ISBN:
(纸本)9781509064946
In this work, a novel bit allocation method based on visual attention and distortion sensitivity is developed for JPEG2000. Although, visual attention map for an image can be measured by using well-known saliency map methods, true visual attention map can be obtained by conducting experiments to determine fixation points and their durations. A perception model might turn these duration of fixations into visual attention levels. Besides visual attention, visual distortion sensitivity may guide the bit allocation process effectively. This is because human visual system is more sensitive to the distortion around the edges than the distortion in the complex textured areas. In this work, a novel visual distortion sensitivity method that considers all edges without using a threshold for gradient magnitude and uses local entropy of gradient orientation distribution is proposed. Thus, the visual attention and the distortion sensitivity level of each code-block determine its quantization parameters. Using bit allocation based on the visual attention map provides higher subjective evaluation score than using bit allocation based on the post compression rate-distortion optimization method or on a previously proposed method based on the saliency map. Secondly, it is shown that the use of visual distortion sensitivity allows higher objective evaluation scores to be attained.
In this paper, we present a novel image scaling method that employs a mesh model that explicitly represents discontinuities in the image. Our method effectively addresses the problem of preserving the sharpness of edg...
详细信息
ISBN:
(纸本)9781538607008
In this paper, we present a novel image scaling method that employs a mesh model that explicitly represents discontinuities in the image. Our method effectively addresses the problem of preserving the sharpness of edges, which has always been a challenge, during image enlargement. We use a constrained Delaunay triangulation to generate the model and an approximating function that is continuous everywhere except across the image edges (i.e., discontinuities). The model is then rasterized using a subdivision-based technique. visual comparisons and quantitative measures show that our method can greatly reduce the blurring artifacts that can arise during image enlargement and produce images that look more pleasant to human observers, compared to the well-known bilinear and bicubic methods.
In this paper, a spectral graph based image coding method is proposed. In the proposed method, only the smoothing parameter in the graph adjacency matrix, graph Fourier transform (GFT) coefficients kept after hard thr...
详细信息
ISBN:
(纸本)9781509064946
In this paper, a spectral graph based image coding method is proposed. In the proposed method, only the smoothing parameter in the graph adjacency matrix, graph Fourier transform (GFT) coefficients kept after hard thresholding and column indices of eigenvectors corresponding to kept coefficients are sent to the decoder side. It is advantageous in that this coding process is performed by sending much less information as compared to other graph based methods. Our results indicate that our proposed method enables image compression without deteriorating the visual quality by using very small numbers of GFT coefficients.
Automatic segmentation of colorectal polyps from endoscopic images forms an interesting challenge in computer vision. The method proposed in this paper intends to segment colorectal polyp (abnormal) regions from norma...
详细信息
Automatic segmentation of colorectal polyps from endoscopic images forms an interesting challenge in computer vision. The method proposed in this paper intends to segment colorectal polyp (abnormal) regions from normal regions from a given endoscopic image. Due to lack of any regular texture patterns in this kind of images and apparent visual similarity in background and foreground pixels, conventional texture feature extraction and classification methods do not always yield good results. Hence, active contour based method has been explored to automatically segment out probable abnormal region(s). Our aim is to automatically detect the probable polyp region(s) and then verify the results with respect to the ground truth. Due to lack of very definitive edge criteria along the boundaries of a polyp, we used “active contour without edges” instead of classical active contour.
This paper deals with the visual positioning task in the image-based visual servoing. The controller is based on the discretized model of image Jacobian matrix. The visual servo controller can solve the intractable pr...
详细信息
ISBN:
(数字)9789811052309
ISBN:
(纸本)9789811052309;9789811052293
This paper deals with the visual positioning task in the image-based visual servoing. The controller is based on the discretized model of image Jacobian matrix. The visual servo controller can solve the intractable problems, such as large displacements between the initial and the desired pose of the camera. This controller can achieve a smooth and linear image trajectory in the image space. The developed IBVS controller also enhances the camera trajectory in 3-D space with time-varying depth. The proposed line feature extraction method is robust to the image noises, the dim light condition and the shadows. The simulation test is performed to validate the effectiveness of the proposed image-based visual controller and line feature extraction method.
Halftoning is essentially required to print image efficiently. The objective of this vital stage of printing imageprocessing pipeline, is to reduce the colour levels of an image without losing its original perception...
详细信息
Halftoning is essentially required to print image efficiently. The objective of this vital stage of printing imageprocessing pipeline, is to reduce the colour levels of an image without losing its original perception. It is also used for efficient storage and transmission purpose. Fundamental techniques of halftoing are as: (1) Dithering, (2) Error Diffusion, (3) Iterative algorithm. In error diffusion technique, it is a challenge to reduce the error. Proposed algorithm defines a circular filter that considers error of neighbours of a pixel for processing. It pulls the error of the neighbouring pixels, sums it to the pixel and processes the pixel. visual perception of halftone image by proposed algorithm quite satisfactory and in experiment, we demonstrated the performance of six existing algorithms and compared it with proposed one. The mean square error (MSE), root mean square error (RMSE), peak signal-to-noise ratio (PSNR), and structure similarity index map (SSIM) of output image are 1.1535e+04, 107.3990, 7.5108, 0.1221 respectively.
One of the main problem of the optical imaging systems is limited depth of field which prevent from obtaining an all-in-focus image of the environment. This paper proposes a novel pixel-based multi-focus image fusion ...
详细信息
ISBN:
(纸本)9781509064946
One of the main problem of the optical imaging systems is limited depth of field which prevent from obtaining an all-in-focus image of the environment. This paper proposes a novel pixel-based multi-focus image fusion method making use of a series of multi-focus images to obtain an all-in-focus image. The proposed method, firstly, generate a focus map over the edges. Then, a full focus map is obtained by propagating the focus values at edge locations to the entire image. The proposed method and the state-of-the-art methods are compared in terms of both quantitative and visual evaluation. Based on the results, the proposed method outperforms the other ones.
visually impaired people face lot of difficulties in their daily life. Many a times they rely on others for help. Several technologies for assistance of visually impaired people have been developed. Among the various ...
详细信息
visually impaired people face lot of difficulties in their daily life. Many a times they rely on others for help. Several technologies for assistance of visually impaired people have been developed. Among the various technologies being utilized to assist the blind, Computer Vision based solutions are emerging as one of the most promising options due to their affordability and accessibility. This paper proposes a system for visually impaired people. The proposed system aims to create a wearable visual aid for visually impaired people in which speech commands are accepted from the user. Its functionality addresses identification of objects and sign boards. This will help the visually impaired person to manage day-to-day activities and to navigate through his/her surroundings. Raspberry Pi is used to implement artificial vision using python language on the Open CV platform.
暂无评论