检索结果-内蒙古大学图书馆

Co-retrieval: a boosted reranking approach for video retrieval

3rd international conference on image and Video Retrieval (CIVR 2004)

作者： Yan, R Hauptmann, AG Carnegie Mellon Univ Sch Comp Sci Pittsburgh PA 15213 USA

ISBN: (纸本)3540225390

Video retrieval compares multimedia queries to items in a video collection in multiple dimensions and combines all the similarity scores into a final retrieval ranking. Although text is the most reliable feature for video retrieval, features from other modalities can provide complementary information. A reranking framework for video retrieval to augment text feature based retrieval with other evidence is presented. A boosted reranking algorithm called co-retrieval is then introduced, which combines a boosting type learning algorithm and a noisy label prediction scheme to select automatically the most useful (weak) features from multiple modalities. The proposed approach is evaluated with queries and video from the 65 h test collection of the 2003 NIST TRECVID evaluation and it achieves considerable improvement over several baseline retrieval algorithms.

关键词： image retrieval

来源：评论

学校读者我要写书评

暂无评论

Automated detection and identification of persons in video using a coarse 3-D head model and multiple texture maps

Automated detection and identification of persons in video u...

引用

3rd international conference on image and Video Retrieval (CIVR 2004)

作者： Everingham, M Zisserman, A Univ Oxford Dept Engn Sci Visual Geometry Grp Oxford OX1 3PJ England

Progress in the automatic detection and identification of humans in video, given a minimal number of labelled faces as training data, is described. This is an extremely challenging problem owing to the many sources of variation in a person's imaged appearance: pose variation, scale, facial expression, illumination, partial occlusion, motion blur, etc. The method developed in this work combines approaches from computer vision, for detection and pose estimation, with those from machine learning for classification. A 'generative' model of a person's head is defined consisting of a coarse 3-D model and multiple texture maps. This allows faces to be rendered with a variety of facial expressions and at poses differing from those of the training data. It is shown that the identity of a target face can then be determined by first proposing faces with similar pose, and then classifying the target face as one of the proposed faces or not. Furthermore, the texture maps of the model can be automatically updated as new poses and expressions are detected. Results of detecting three characters in a TV situation comedy are demonstrated.

关键词： image analysis

来源：评论

学校读者我要写书评

暂无评论

3-D visualization system of the cranium based on x-ray images

3-D visualization system of the cranium based on x-ray image...

引用

3rd international conference on Medical Information Visualisation - BioMedical Visualisation, MediVis 2005

作者： Pan, Jun-Jun Zhang, Yan-Ning Zhou, Hong Feng, David Dagan School of Computer Science Northwestern Poly-technical University Xi'an China Stomatology Hospital of Xi'an Jiao Tong University Xi'an China Multimedia Computing Research Laboratory Sydney University Australia

ISBN: (纸本)0769523935

A 3-d visualization system of the cranium based on reconstruction from X-rays is presented. Since X-rays belong to the penetrating projection images, the objects do not have definite surface in images. To solve this problem, an approach of pasting the lead granules on the face of a patient and reconstructing the face through the correlated vision is adopted. Then the 3-d cranium model is built by subtracting the thickness of soft tissue from the face model. The whole system consists of image pre-processing, feature point recognition, matching, texture mapping, animation, and 3-d measurement. Only the X-ray machine, adhesive tapes with lead granules and Multrasonograph are needed. The experiment demonstrates that this approach is effective and of high precision. It can help doctors examine and measure the face and cranium of patients by computer. Hopefully, this product could be developed and applied in clinic one day.

关键词： Medical imaging

来源：评论

学校读者我要写书评

暂无评论

Reliable image matching via modified Hausdorff distance with normalized gradient consistency measure

Reliable image matching via modified Hausdorff distance with...

引用

international conference on Information Technology: Research and Education (ITRE)

作者： Chyuan-Huei Thomas Yang Shang-Hong Lai Long-Wen Chang Department of Computer Science National Tsing Hua University HsinChu Taiwan

Reliable image matching is important to many problems in computer vision, image processing and pattern recognition. Hausdorff distance and many of its variations have been employed for image matching with success. In this paper we propose an improved image matching method based on a modified Hausdorff distance with normalized gradient consistency measure. The proposed new image matching algorithm integrates the geometric Hausdorff distance with the photometric intensity gradient information to obtain a better image similarity measure. To show the improvement of the proposed algorithm, we test it with some previous image matching methods on the problem of face recognition under lighting changes. Experimental results show the proposed method produces more accurate face recognition than the previous methods.

关键词： image matching Face recognition Face detection computer science computer vision image processing Pattern recognition image retrieval image databases Spatial databases

来源：评论

学校读者我要写书评

暂无评论

Development of a machine vision system for automotive part inspection - art. no. 60412J

Development of a machine vision system for automotive part i...

引用

3rd international conference on Mechatronics and Information Technology (ICMIT 05)

作者： Andres, NS Marimuthu, RP Eom, YK Jang, BC Andong Natl Univ Dept Mech Engn Andong 760749 South Korea

ISBN: (纸本)0819460737

As an alternative for human inspection, presented in this study was the development of a machine vision inspection system (MVIS) purposely for car seat frames. The proposed MVIS was designed to meet the demands, features and specifications of car seat frame manufacturing companies in striving for increased throughput of better quality. This computer-based MVIS was designed to perforin quality measures by detecting holes, nuts and welding spots on every car seat frame in real time and ensuring these portions are intact, precise and in proper place. In this study, the NI vision Builder software for Automatic Inspection was used as a solution in configuring the aimed quality measurements. The proposed software has measurement techniques such as edge detecting and pattern-matching which are capable of identifying the boundaries or edges of an object and analyzing the pixel values along the profile to detect significant intensity changes. Either of these techniques is capable of gauging sizes, detecting missing portion and checking alignment of parts. The techniques for visual inspection were optimized through qualitative analysis and simulation of human tolerance on inspecting car seat frames. Furthermore, this study exemplified the incorporation of the optimized vision inspection environment to the pre-inspection and post-inspection subsystems. The optimized participation of human on this proposed MVIS for car seat frames has ideally eased to feeding and sorting.

关键词： car seat frame machine vision inspection system (MVIS) vision builder for automatic inspection programmable logic controller (PLC) LabView software image acquisition mechatronics

来源：评论

学校读者我要写书评

暂无评论

Adjunctions in pyramids, curve evolution and scale-spaces

引用

international JOURNAL OF computer vision 2003年第2-3期52卷 139-151页

作者： Keshet, R Heijmans, HJAM Hewlett Packard Labs IL-32000 Haifa Israel Ctr Math & Comp Sci CWI NL-1098 SJ Amsterdam Netherlands

We have been witnessing lately a convergence among mathematical morphology and other nonlinear fields, such as curve evolution, PDE-based geometrical image processing, and scale-spaces. An obvious benefit of such a convergence is a cross-fertilization of concepts and techniques among these fields. The concept of adjunction however, so fundamental in mathematical morphology, is not yet shared by other disciplines. The aim of this paper is to show that other areas in image processing can possibly benefit from the use of adjunctions. In particular, a strong relationship between pyramids and adjunctions is presented. We show how this relationship may help in analyzing existing pyramids, and construct new pyramids. Moreover, it will be explained that adjunctions based on a curve evolution scheme can provide idempotent shape filters. This idea is illustrated in this paper by means of a simple affine-invariant polygonal flow. Finally, the use of adjunctions in scale-space theory is also addressed.

关键词： adjunctions mathematical morphology curve evolution scale-spaces pyramids partially ordered sets image processing

来源：评论

学校读者我要写书评

暂无评论

Watermark embedding mechanism using modulus-based for intellectual property protection on image data

引用

3rd international conference on E-commerce and Web Technology, held in conjunction with the DEXA 02, EC-Web 2002

作者： Wang, Shiuh-Jeng Yang, Kai-Sheng Department of Information Management Central Police University Taoyuan 333 Taiwan Computer Crime Squad Criminal Investigation Bureau National Police Administration Ministry of the Interior Taiwan

ISBN: (纸本)3540441379

In this paper, an intellectual property protection mechanism realized on a watermarking scheme is proposed. The embedding technique we adopted in this paper is based on the modular operation. The modulus is a threshold value which determines how the binary pattern of watermark to be embedded into an image. In order to conduct a better fidelity of the image with the embedded watermark against the perception of human vision system, the random bit-string transformed from the watermark is done first. Then there are two classifications required to perform for the random bit pattern 0/1 during the embedding procedure. Afterwards, we issue several frequent image processing tests, such as JPEG compression, diverse filtering treatments, resampling and requantization, etc., to promote our remarkable result. Comparing to the previous literature in [12] as observed from the experiments, not only the advantages emphasized in [12] are remained, but also the original image is not necessary to check again so as to extract the embedded watermark. Therefore, the scheme explored in this paper is more efficient than the previous scheme, and it is feasible to cope with the protection of intellectual property on the digital image recognition. © Springer-Verlag Berlin Heidelberg 2002.

关键词： image compression

来源：评论

学校读者我要写书评

暂无评论

Geometry motivated variational segmentation for color images 3

引用

3rd international conference on Scale-Space and Morphology in computer vision, Scale-Space 2001

作者： Brook, Alexander Kimmel, Ron Sochen, Nir A. Dept. Of Mathematics Technion Israel Dept. Of Computer Science Technion Israel Dept. Of Applied Mathematics Tel-Aviv University Israel

We propose image enhancement, edge detection, and segmentation models for the multi-channel case, motivated by the philosophy of processing images as surfaces, and generalizing the Mumford-Shah functional. Refer to ht... 详细信息

ISBN: (纸本)9783540423171

关键词： image enhancement

来源：评论

学校读者我要写书评

暂无评论

3rd international Workshop on Energy Minimization Methods in computer vision and Pattern Recognition, EMMCVPR 2001

引用

3rd international Workshop on Energy Minimization Methods in computer vision and Pattern Recognition, EMMCVPR 2001

ISBN: (纸本)3540425233

The proceedings contain 42 papers. The special focus in this conference is on Probabilistic Models and Estimation. The topics include: A double-loop algorithm to minimize the bethe free energy;a variational approach to maximum a posteriori estimation for image denoising;maximum likelihood estimation of the template of a rigid moving object;an application to shape retrieval;a fast MAP algorithm for 3D ultrasound;designing the minimal structure of hidden markov model by bisimulation;relaxing symmetric multiple windows stereo using markov random fields;camera calibration for 3-D surface reconstruction;a hierarchical markov random field model for figure-ground segregation;articulated object tracking via a genetic algorithm;learning matrix space image representations;supervised texture segmentation by maximising conditional likelihood;optimization of paintbrush rendering of images by dynamic MCMC methods;illumination invariant recognition of color texture using correlation and covariance functions;path based pairwise data clustering with application to texture segmentation;a maximum likelihood framework for grouping and segmentation;image labeling and grouping by minimizing linear functionals over cones;grouping with directed relationships;segmentations of spatio-temporal images by spatio-temporal markov random field model;highlight and shading invariant color image segmentation using simulated annealing;edge based probabilistic relaxation for sub-pixel contour extraction;two variational models for multispectral image classification;an experimental comparison of min-cut/max-flow algorithms for energy minimization in vision;a discrete/continuous minimization method in interferometric image processing and a transformation approach.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Multimodality image registration and fusion using neural network

Multimodality image registration and fusion using neural net...

引用

3rd international conference on Information Fusion, FUSION 2000

作者： Mostafa, Mostafa G. Farag, Aly A. Essock, Edward Computer Vision and Image Processing Laboratory Department of Electrical and Computer Engineering University of Louisville Louisville KY 40292 United States Department of Psychology University of Louisville Louisville KY 40292 United States

ISBN: (纸本)2725700000

Multi-modality image registration and fusion are essential steps in building 3D models from remote sensing data. In this paper, we present a neural network technique for the registration and fusion of multi-modality remote sensing data for the reconstruction of 3D models of terrain regions. A feedforward neural network is used to fuse the intensity data sets with the spatial data set after learning its geometry. Results on real data are presented. Human performance evaluation is assessed on several perceptual tests in order to evaluate the fusion results. © 2000 Int. Soc. Inf. Fusion.

关键词： image fusion

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：