image registration is widely used in remote sensing imageprocessing. On one hand, non-subsamded Contourlet transform (NSCT) has the advantage of decomposing image in a flexible way;on the other hand, cross-cumulative...
详细信息
image registration is widely used in remote sensing imageprocessing. On one hand, non-subsamded Contourlet transform (NSCT) has the advantage of decomposing image in a flexible way;on the other hand, cross-cumulative residual entropy (CCRE) is effective in remote sensing image registration. Considering that, we propose a multimodal remote sensing image registration method which is based on cross-cumulative residual entropy and NSCT algorithm. First, the reference image and target image are decomposed with NSCT to obtain low frequency images, and then the cross-cumulative residual entropy of the obtained low frequency images is calculated. Set the cross-cumulative residual entropy as a similarity measurement. Secondly, Newton's method is employed to gain optimal parameters of the affine transformation model. Finally, the image registration is obtained with the optimal parameters. To validate our algorithm, we test two remote sensing images with our method. Simulation results show that the proposed method is able to find the global optimum rapidly and prevent dropping into a local minimum. In general, it is not only a fast and effective multimodal remote sensing image registration algorithm but also the one with high registration accuracy.
This paper proposes a novel method to determine the speed of the surrounding vehicles in traffic scenarios. Relying on the video information obtained from a stereo camera mounted on a moving vehicle, we first determin...
详细信息
ISBN:
(纸本)9781479914920
This paper proposes a novel method to determine the speed of the surrounding vehicles in traffic scenarios. Relying on the video information obtained from a stereo camera mounted on a moving vehicle, we first determine the vehicle ego motion based on static scene features then we determine the relative motion between objects based on features situated on the moving objects. For robustness to false feature matches everything is plugged into a multi-RANSAC framework. The novelty of the method consist in the fact that the relative motion between the objects can be determined with the same algorithm that was previously used for ego motion estimation, the only difference consisting in the geometric constraints that are imposed to the subset of point features considered for inliers set detection and evaluation. Also, the proposed method does not rely on the fact that objects are detected previously and it does not detect the objects.
We introduce a semi-automatic tracking method that can be utilized for the analysis of facial markers in the medical condition of facial palsy. Tracking of markers will help medical physicians in evaluating this medic...
详细信息
In this paper we propose a comparative review between the proposed digital audio watermarking technique and those achieved by Luigi Rosa and Rolf Brigola. The performed technique operates in the frequency domain. The ...
详细信息
We study the task of interactive semantic labeling of a segmentation hierarchy. To this end we propose a framework interleaving two components: an automatic labeling step, based on a Conditional Random Field whose dep...
详细信息
We introduce a semi-automatic tracking method that can be utilized for the analysis of facial markers in the medical condition of facial palsy. Tracking of markers will help medical physicians in evaluating this medic...
详细信息
We introduce a semi-automatic tracking method that can be utilized for the analysis of facial markers in the medical condition of facial palsy. Tracking of markers will help medical physicians in evaluating this medical condition quantitatively. We use particle filtering to track markers towards measuring distances needed to evaluate the degree of facial palsy. We show that by employing tracking methods, the analysis time is reduced without losing the high accuracy of the results.
3D displays enable immersive visual impressions but the impact on the human perception still is not fully understood. Viewing conditions like the convergence-accommodation (C-A) conflict have an unnatural influence on...
详细信息
3D displays enable immersive visual impressions but the impact on the human perception still is not fully understood. Viewing conditions like the convergence-accommodation (C-A) conflict have an unnatural influence on the visual system and might even lead to visual discomfort. As visual perception is individual we assumed the impact of simulated 3D content on the visual system to be as well. In this study we aimed to analyze the stereoscopic visual performance of 17 subjects for disparities inside and outside the in literature defined zone of comfortable viewing to provide an individual evaluation of the impact of increased disparities on the performance of the visual system. Stereoscopic stimuli were presented in a four-alternative forced choice (4AFC) setup in different disparities. The response times as well as the correct decision rates indicated the performance of stereoscopic vision. The results showed that increased disparities lead to a decline in performance. Further, the impact of the presented disparities is dependent on the difficulty of the task. The decline of performance as well as the deciding disparities for the decline were subject dependent.
This paper presents an object-based method for analysing the content drawn by graphical operators in natively digital PDF documents. We propose that graphical content in a document can be classified either as structur...
详细信息
In this paper we propose a comparative review between the proposed digital audio watermarking technique and those achieved by Luigi Rosa and Rolf Brigola. The performed technique operates in the frequency domain. The ...
详细信息
ISBN:
(纸本)9781467315906
In this paper we propose a comparative review between the proposed digital audio watermarking technique and those achieved by Luigi Rosa and Rolf Brigola. The performed technique operates in the frequency domain. The time-frequency mapping is done using a Modified Discrete Cosine Transform (MDCT). The technique developed by Luigi Rosa operates in the frequency domain but using the Discrete Cosine Transform (DCT) as transformation and that proposed by Rolf Brigola uses the Fast Fourier Transform (FFT). We studied the robustness of each technique against different types of attack and we evaluated the inaudibility by using a statistical approach by calculating the SNR and an objective approach by calculating the ODG notes given by PEAQ.
Document images prove to be a difficult case for standard stereo correspondence approaches. One of the major problem is that document images are highly self-similar. Most algorithms try to tackle this problem by incor...
详细信息
Document images prove to be a difficult case for standard stereo correspondence approaches. One of the major problem is that document images are highly self-similar. Most algorithms try to tackle this problem by incorporating a global optimization scheme, which tends to be computationally expensive. In this paper, we show that incorporation of layout information into the matching paradigm, as a grouping entity for features, leads to better results in terms of robustness, efficiency, and ultimately in a better 3D model of the captured document, that can be used in various document restoration systems. This can be seen as a divide and conquer approach that partitions the search space into portions given by each grouping entity and then solves each of them independently. As a grouping entity text-lines are preferred over individual character blobs because it is easier to establish correspondences. Text-line extraction works reasonably well on stereo image pairs in the presence of perspective distortions. The proposed approach is highly efficient and matches obtained are more reliable. The claims are backed up by showing their practical applicability through experimental evaluations.
暂无评论