The current trends in computer animations, virtual reality and augmented reality applications have resulted in the problem of texture synthesis receiving considerable attention from the computer graphics and image pro...
详细信息
ISBN:
(纸本)0780381858
The current trends in computer animations, virtual reality and augmented reality applications have resulted in the problem of texture synthesis receiving considerable attention from the computer graphics and imageprocessing research community. In this paper we revisit the well-known texture synthesis method, image quilting, first proposed by Efros and Freeman and propose improvements to its algorithms and implementation. Further, we outline its application within a multi-resolution framework, particularly suited for modern imaging applications associated with compressed image data. We provide experimental results to prove the effectiveness of the proposed modifications and variation.
More selective methods for applying agricultural herbicides on fields can result in substantial cost savings. Three imageprocessing methods were tested for their ability to identify four different images of plant spe...
详细信息
ISBN:
(纸本)0780327268
More selective methods for applying agricultural herbicides on fields can result in substantial cost savings. Three imageprocessing methods were tested for their ability to identify four different images of plant species. First two images were different and the other two were similar. The images are preprocessed by segmentation and spatial filtering using the Color Chromaticity Chart. The test results provide evidence that texture based methods can provide a useful metric for distinguishing between some species of plants
Mobile microrobots with piezo slip-stick actuation and more than one degree of freedom mostly do not have internal pose sensors. One possibility for fast pose estimation with high accuracy is the application of extern...
详细信息
ISBN:
(纸本)0780389123
Mobile microrobots with piezo slip-stick actuation and more than one degree of freedom mostly do not have internal pose sensors. One possibility for fast pose estimation with high accuracy is the application of external visual sensors like a video camera in combination with a light microscope or a scanning electron microscope (SEM). In particular the use of a SEM makes high demands on the imageprocessing. High update rates of the pose data enforce a short image acquisition time of the SEM images. Hence the image noise increases, because frame averaging or averaging of the detector signal is time consuming. This paper presents a method to calculate the x,y position and the orientation of a micro gripper in a strongly noised SEM image stream with cross-correlation in real-time. To make real-time pose estimation possible, only a region-of-interest (ROI) is correlated with the target pattern. The SEM is almost predestined to work with ROI's, because the scan area of the electron beam can be chosen arbitrarily. At the beginning of the paper the setup of the used mobile microrobot based nanohandling station will be described briefly.
Perceptual image quality assessment (IQA) and sparse signal representation have recently emerged as high-impact research topics in the field of imageprocessing. Here we make one of the first attempts to incorporate t...
详细信息
ISBN:
(纸本)9781457705397
Perceptual image quality assessment (IQA) and sparse signal representation have recently emerged as high-impact research topics in the field of imageprocessing. Here we make one of the first attempts to incorporate the structural similarity (SSIM) index, a promising IQA measure, into the framework of optimal sparse signal representation and approximation. In particular, we introduce a novel image denoising scheme where a modified orthogonal matching pursuit algorithm is proposed for finding the best sparse coefficient vector in maximum-SSIM sense for a given set of linearly independent atoms. Furthermore, a gradient descent algorithm is developed to achieve SSIM-optimal compromise in combining the input and sparse dictionary reconstructed images. Our experimental results show that the proposed method achieves better SSIM performance and provide better visual quality than least square optimal denoising methods.
The aim of this paper is to present a methodology to generate a partition of an image and a hierarchical region merging scheme to improve the meaningfulness of the segmentation, by reducing excessive object fragmentat...
详细信息
ISBN:
(纸本)0819452114
The aim of this paper is to present a methodology to generate a partition of an image and a hierarchical region merging scheme to improve the meaningfulness of the segmentation, by reducing excessive object fragmentation. The segmentation method is based on the watershed transform applied to the image gradient magnitude. Prior to the actual segmentation, the image is smoothed to decrease the amount of detail detected by the watershed transform. To further improve the segmentation result, we use an iterative region merging process that uses a graph to represent the image partitions. In this process the most similar pair of adjacent regions is sequentially merged according to a predefined similarity metric. We investigate the use of a combined region merging criterion that takes into account both the intensity similarity and the contrast at the boundary of two adjacent regions. Results obtained illustrate the good combined performance of this segmentation and merging methods and the usefulness of the combined similarity function.
In most interactive applications, an image is processed for human interpretation, and the human subjectivity can not be ignored. A flexible, easily maneuverable, simple programmable S-function is proposed to simulate ...
详细信息
In most interactive applications, an image is processed for human interpretation, and the human subjectivity can not be ignored. A flexible, easily maneuverable, simple programmable S-function is proposed to simulate the visual response characteristic for producing the best subjectively evaluated results. A description is given of a modification to the programmable S-function.
The performance of image captioning has been significantly improved recently through deep neural network architectures combining with attention mechanisms and reinforcement learning optimization. Exploring visual rela...
详细信息
ISBN:
(纸本)9781665405409
The performance of image captioning has been significantly improved recently through deep neural network architectures combining with attention mechanisms and reinforcement learning optimization. Exploring visual relationships and interactions between different objects appearing in the image, however, is far from being investigated. In this paper, we present a novel approach that combines scene graphs with Transformer, which we call SGT, to explicitly encode available visual relationships between detected objects. Specifically, we pretrain an scene graph generation model to predict graph representations for images. After that, for each graph node, a Graph Convolutional Network (GCN) is employed to acquire relationship knowledge by aggregating the information of its local neighbors. As we train the captioning model, we feed the potential relation-aware information into the Transformer to generate descriptive sentence. Experiments on the MS (XXX) dataset validate the superiority of our SGT model, which can realize state-of-the-art results in terms of all the standard evaluation metrics.
One of the aims of image compression, is decreasing unnecessary and redundant image data for easier storage and transmission in an effective way, taking less storage capacity and less transmission bandwidth. While the...
详细信息
ISBN:
(纸本)9781509064946
One of the aims of image compression, is decreasing unnecessary and redundant image data for easier storage and transmission in an effective way, taking less storage capacity and less transmission bandwidth. While the compression shall result in least information loss and maximum similarity to the original image. By using Systematic Procedure for Predefined Envelope and Signature Sequences (SYMPES) method, this paper shows the effect of block size on processing parameters such as Mean Square Error (MSE), Peak Signal to Noise Ratio (PSNR), Compression Ratio (CR), etc. while other parameters such as image size, total number of signature, envelope and coefficient functions and other parameters are kept fixed and without change. Block size and the effect of block size on quality/distortion metrics is investigated. The advantage of this study is proving the flexibility of SYMPES method towards block sizes but up to a certain point where coefficient intensity is not lost and signature intensity has no negative effect on envelope intensity. Finally, increasing block size up to a certain point, can result to higher compression rate and better PSNR. Therefore, the best block size can be achieved to have most desirable result for image compression.
This paper has advanced a novel hybrid binarization method combining global thresholding and local adaptive thresholding used for foreground and background segmenting. In global thresholding, sub block OSTU is used to...
详细信息
ISBN:
(纸本)9781424462520
This paper has advanced a novel hybrid binarization method combining global thresholding and local adaptive thresholding used for foreground and background segmenting. In global thresholding, sub block OSTU is used to achieve better performance for it can represent local distribution characteristics while global OSTU method cannot. Original image and binary image are scanned individually to locate sub blocks which require further process. Sauvola algorithm implementing integral image is then performed for a second binarization. Simulation results on several printform envelopes show that this algorithm can maintain the same performance with the improved Sauvola algorithm while reducing the processing time by 27.4%.
We investigate the effect of stereo camera separation on 3D perception and realism for handheld 3D video communications. Using a handheld device equipped with an autostereoscopic display, a front-facing stereo camera ...
详细信息
ISBN:
(纸本)9781467350518
We investigate the effect of stereo camera separation on 3D perception and realism for handheld 3D video communications. Using a handheld device equipped with an autostereoscopic display, a front-facing stereo camera can capture left and right views of the user's face. However, consideration must be paid to the camera separation in order to balance both viewer comfort and realism. Using display-camera geometry, we illustrate the relationship between real depths in camera space and perceived depths in display space. We then derive the optimal camera separation to capture depths within a user's face that are consistent with the size of the face on a handheld display, and contrast this result with traditional rules of thumb used by stereographers. These recommendations are evaluated by a perceptual user study with a current 3D handheld device.
暂无评论