检索结果-内蒙古大学图书馆

Multimodal remote sensing image registration based on cross-cumulative residual entropy and NSCT

Guangdianzi Jiguang/Journal of Optoelectronics Laser 2013年第12期24卷 2430-2434页

作者： Shi, Yong Jia, Zhen-Hong Qin, Xi-Zhong Yang, Jie Hu, Raphael College of Information Science and Engineering Xinjiang University Urumqi 830046 China Institute of Image Processing and Pattern Recognition Shanghai Jiao Tong University Shanghai 200240 China Knowledge Engineering and Discovery Research Institute Auckland University of Technology Auckland 1020 New Zealand

image registration is widely used in remote sensing image processing. On one hand, non-subsamded Contourlet transform (NSCT) has the advantage of decomposing image in a flexible way;on the other hand, cross-cumulative residual entropy (CCRE) is effective in remote sensing image registration. Considering that, we propose a multimodal remote sensing image registration method which is based on cross-cumulative residual entropy and NSCT algorithm. First, the reference image and target image are decomposed with NSCT to obtain low frequency images, and then the cross-cumulative residual entropy of the obtained low frequency images is calculated. Set the cross-cumulative residual entropy as a similarity measurement. Secondly, Newton's method is employed to gain optimal parameters of the affine transformation model. Finally, the image registration is obtained with the optimal parameters. To validate our algorithm, we test two remote sensing images with our method. Simulation results show that the proposed method is able to find the global optimum rapidly and prevent dropping into a local minimum. In general, it is not only a fast and effective multimodal remote sensing image registration algorithm but also the one with high registration accuracy.

关键词： Contourlet transform

来源：评论

学校读者我要写书评

暂无评论

Speed estimation for scene objects using stereo visual odometry methods

Speed estimation for scene objects using stereo visual odome...

引用

IEEE International Conference on Intelligent Computer Communication and processing

作者： Catalin Golban Sergiu Nedevschi Image Processing and Pattern Recognition Group Computer Science Department Technical University of Cluj-Napoca

ISBN: (纸本)9781479914920

This paper proposes a novel method to determine the speed of the surrounding vehicles in traffic scenarios. Relying on the video information obtained from a stereo camera mounted on a moving vehicle, we first determine the vehicle ego motion based on static scene features then we determine the relative motion between objects based on features situated on the moving objects. For robustness to false feature matches everything is plugged into a multi-RANSAC framework. The novelty of the method consist in the fact that the relative motion between the objects can be determined with the same algorithm that was previously used for ego motion estimation, the only difference consisting in the geometric constraints that are imposed to the subset of point features considered for inliers set detection and evaluation. Also, the proposed method does not rely on the fact that objects are detected previously and it does not detect the objects.

关键词： Visual odometry Obstacles speed Speed detection Ego motion Motion estimation Speed estimation

来源：评论

学校读者我要写书评

暂无评论

Semi-automatic tracking of markers in facial palsy

Semi-automatic tracking of markers in facial palsy

引用

21st International Conference on pattern recognition, ICPR 2012

作者： Limbeck, Philip Kropatsch, Walter G. Haxhimusa, Yll Vienna University of Technology Pattern Recognition and Image Processing Group Austria

ISBN: (纸本)9784990644109

We introduce a semi-automatic tracking method that can be utilized for the analysis of facial markers in the medical condition of facial palsy. Tracking of markers will help medical physicians in evaluating this medical condition quantitatively. We use particle filtering to track markers towards measuring distances needed to evaluate the degree of facial palsy. We show that by employing tracking methods, the analysis time is reduced without losing the high accuracy of the results. © 2012 ICPR Org Committee.

关键词： pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Comparative analysis of audio watermarking technique in MDCT domain with other references in spectral domain

Comparative analysis of audio watermarking technique in MDCT...

引用

9th International Multi-Conference on Systems, Signals and Devices, SSD 2012

作者： Bellaaj, Maha Ouni, Kais Research Unit of Signal Processing Image and Pattern Recognition National Engineering School of Tunis University Tunis Manar Tunisia

ISBN: (纸本)9781467315906

In this paper we propose a comparative review between the proposed digital audio watermarking technique and those achieved by Luigi Rosa and Rolf Brigola. The performed technique operates in the frequency domain. The time-frequency mapping is done using a Modified Discrete Cosine Transform (MDCT). The technique developed by Luigi Rosa operates in the frequency domain but using the Discrete Cosine Transform (DCT) as transformation and that proposed by Rolf Brigola uses the Fast Fourier Transform (FFT). We studied the robustness of each technique against different types of attack and we evaluated the inaudibility by using a statistical approach by calculating the SNR and an objective approach by calculating the ODG notes given by PEAQ. © 2012 IEEE.

关键词： Discrete cosine transforms

来源：评论

学校读者我要写书评

暂无评论

Interactive labeling of image segmentation hierarchies

Interactive labeling of image segmentation hierarchies

引用

Joint 34th Symposium of the German Association for pattern recognition, DAGM 2012 and 36th Symposium of the Austrian Association for pattern recognition, OAGM 2012

作者： Zankl, Georg Haxhimusa, Yll Ion, Adrian Pattern Recognition and Image Processing Group 186/3 Institute for Computer Aided Automation Vienna University of Technology Austria Institute of Science and Technology Austria

ISBN: (纸本)9783642327162

We study the task of interactive semantic labeling of a segmentation hierarchy. To this end we propose a framework interleaving two components: an automatic labeling step, based on a Conditional Random Field whose dependencies are defined by the inclusion tree of the segmentation hierarchy, and an interaction step that integrates incremental input from a human user. Evaluated on two distinct datasets, the proposed interactive approach efficiently integrates human interventions and illustrates the advantages of structured prediction in an interactive framework. © 2012 Springer-Verlag.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Semi-automatic tracking of markers in facial palsy

Semi-automatic tracking of markers in facial palsy

引用

International Conference on pattern recognition

作者： Philip Limbeck Walter G. Kropatsch Yll Haxhimusa Pattern Recognition and Image Processing Group Vienna University of Technology Austria

关键词： Face Atmospheric measurements Particle measurements Target tracking Accuracy Biomedical imaging Time measurement

来源：评论

学校读者我要写书评

暂无评论

Measurement of Individual Changes in the Performance of Human Stereoscopic Vision for Disparities at the Limits of the Zone of Comfortable Viewing

Measurement of Individual Changes in the Performance of Huma...

引用

International Conference on 3D Imaging, Modeling, processing, Visualization and Transmission (3DIMPVT)

作者： Jan Paulus Georg Michelson Marcus Barkowsky Joachim Hornegger Bjöern Eskofier Michael Schmidt Pattern Recognition Laboratory Institute of Photonic Technologies Erlangen Graduate School in Advanced Optical Technologies (SAOT) University of Erlangen-Nuremberg Germany Department of Ophthalmology Erlangen Graduate School in Advanced Optical Technologies (SAOT) University of Erlangen-Nuremberg Germany Image and video-communication (IVC) research group LUNAM Université IRCCyN UMR CNRS 6597 Polytech Nantes Université de Nantes France Pattern Recognition Laboratory Erlangen Graduate School in Advanced Optical Technologies (SAOT) University of Erlangen-Nuremberg Germany Institute of Photonic Technologies Erlangen Graduate School in Advanced Optical Technologies (SAOT) University of Erlangen-Nuremberg Germany

3D displays enable immersive visual impressions but the impact on the human perception still is not fully understood. Viewing conditions like the convergence-accommodation (C-A) conflict have an unnatural influence on the visual system and might even lead to visual discomfort. As visual perception is individual we assumed the impact of simulated 3D content on the visual system to be as well. In this study we aimed to analyze the stereoscopic visual performance of 17 subjects for disparities inside and outside the in literature defined zone of comfortable viewing to provide an individual evaluation of the impact of increased disparities on the performance of the visual system. Stereoscopic stimuli were presented in a four-alternative forced choice (4AFC) setup in different disparities. The response times as well as the correct decision rates indicated the performance of stereoscopic vision. The results showed that increased disparities lead to a decline in performance. Further, the impact of the presented disparities is dependent on the difficulty of the task. The decline of performance as well as the deciding disparities for the decline were subject dependent.

关键词： Visualization Time factors Three-dimensional displays Educational institutions Stereo image processing Adaptive optics Optical imaging

来源：评论

学校读者我要写书评

暂无评论

Document understanding of graphical content in natively digital PDF documents 12

Document understanding of graphical content in natively digi...

引用

2012 ACM Symposium on Document Engineering, DocEng 2012

作者： Gabdulkhakova, Aysylu Hassan, Tamir Department of Computing Mathematics and Cybernetics Ufa State Aviation Technical University K. Marx str. 12 450000 Ufa Russia Pattern Recognition and Image Processing Group Technische Universität Wien Favoritenstraße 9-11 1040 Wien Austria

ISBN: (纸本)9781450311168

This paper presents an object-based method for analysing the content drawn by graphical operators in natively digital PDF documents. We propose that graphical content in a document can be classified either as structural or nonstructural and present an output model for our analysis result. Heuristic techniques are used to group the instructions into regions and determine their logical role in the document's structure. Experimental results demonstrate the effectiveness of the algorithm. Copyright © 2012 by the Association for Computing Machinery, Inc. (ACM).

关键词： Structural analysis

来源：评论

学校读者我要写书评

暂无评论

Comparative analysis of audio watermarking technique in MDCT domain with other references in spectral domain

Comparative analysis of audio watermarking technique in MDCT...

引用

IEEE SSD International Multi-Conference on Systems, Signals and Devices

作者： Maha Bellaaj Kaïs Ouni Research Unit of Signal Processing Image and Pattern Recognition National Engineering School of Tunis University Tunis El Manar II Tunis Tunisia

ISBN: (纸本)9781467315906

关键词： Robustness Discrete cosine transforms Frequency domain analysis Watermarking Signal to noise ratio Niobium Psychoacoustic models

来源：评论

学校读者我要写书评

暂无评论

Improvements to Uncalibrated Feature-Based Stereo Matching for Document images by Using Text-Line Segmentation

Improvements to Uncalibrated Feature-Based Stereo Matching f...

引用

IAPR International Workshop on Document Analysis Systems, DAS

作者： Muhammad Zeshan Afzal Martin Kramer Syed Saqib Bukhari Faisal Shafait Thomas M. Breuel Image Understanding and Pattern Recognition Group Technical University of Kaiserslautern Kaiserslautern Germany Multimedia Analysis and Data Mining Research Group German Research Center for Artificial Intelligence Kaiserslautern Germany

Document images prove to be a difficult case for standard stereo correspondence approaches. One of the major problem is that document images are highly self-similar. Most algorithms try to tackle this problem by incorporating a global optimization scheme, which tends to be computationally expensive. In this paper, we show that incorporation of layout information into the matching paradigm, as a grouping entity for features, leads to better results in terms of robustness, efficiency, and ultimately in a better 3D model of the captured document, that can be used in various document restoration systems. This can be seen as a divide and conquer approach that partitions the search space into portions given by each grouping entity and then solves each of them independently. As a grouping entity text-lines are preferred over individual character blobs because it is easier to establish correspondences. Text-line extraction works reasonably well on stereo image pairs in the presence of perspective distortions. The proposed approach is highly efficient and matches obtained are more reliable. The claims are backed up by showing their practical applicability through experimental evaluations.

关键词： Feature extraction Robustness Three dimensional displays image segmentation Cameras Solid modeling Stereo vision

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：